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ELITRA.001A PATENT 
GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION IN 

ESCHERICHIA COLI 
RELATED APPLICATIONS 
5 This application claims priority from U.S. Provisional Patent Application Serial 

Number 60/117,405 filed January 27, 1999, the disclosure of which is incorporated 
herein by reference in its entirety. 

BACKGROUND OF THE INVENTION 
Since the discovery of penicillin, the use of antibiotics to treat the ravages of 
10 bacterial infections has saved millions of lives. With the advent of these "miracle drugs," 
for a time it was popularly believed that humanity might, once and for all, be saved from 
the scourge of bacterial infections. In fact, during the 1980s and early 1990s, many large 
pharmaceutical companies cut back or eliminated antibiotics research and development. 
They believed that infectious disease caused by bacteria finally had been conquered and 
1 5 that markets for new drugs were limited. Unfortunately, this belief was overly optimistic. 

The tide is begmning to turn in favor of the bacteria as reports of drug resistant 
bacteria become more frequent. The United States Centers for Disease Control aimounced 
that one of the most powerful known antibiotics, vancomycin, was unable to treat an 
infection of the common Staphylococcus aureus (staph). This organism is commonly 
20 found in our environment and is responsible for many nosocomial infections. The 
import of this announcement becomes clear when one considers that vancomycin was 
used for years to treat infections caused by stubborn strains of bacteria, like staph. In 
short, the bacteria are becoming resistant to our most powerful antibiotics. If this trend 
continues, it is conceivable that we will return to a time when what are presently 
25 considered minor bacterial infections are fatal diseases. 

There are a number of causes for the predicament in which practitioners of medical 
arts find themselves. Over-prescription and improper prescription habits by some 
physicians have caused an indiscriminate increase in the availability of antibiotics to the 
pubUc. The patient is also partly responsible, for even in instances where an antibiotic is 
30 the appropriate treatment, patients will often improperly use the drug, the result being yet 
another population of bacteria that is resistant, in whole or in part, to traditional antibiotics. 
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The bacterial scourges that have haunted humanity remain, in spite of the 
development of modem scientific practices to deal with the diseases that they cause. Drug 
resistant bacteria are now advancing on the health of humanity. A new generation of 
antibiotics to once again deal with the pending health threat that bacteria present is 
5 required. 

Discovery^ of New Antibiotics 
As more and more bacterial strains become resistant to the panel of available 
antibiotics, new compounds are required. In the past, practitioners of pharmacology 
would have to rely upon traditional methods of drug discovery to generate novel, safe 

10 and efficacious compounds for the treatment of disease. Traditional drug discovery 
methods involve blindly testing potential drug candidate-molecules, often selected at 
random, in the hope that one might prove to be an effective treatment for some disease. 
The process is painstaking and laborious, with no guarantee of success. Today, the 
average cost to discover and develop a new drug is nearly US $500 million, and the 

15 average time is 15 years from laboratory to patient. Improving this process, even 

incrementally, would represent a huge advance in the generation of novel antimicrobial 
agents. 

Newly emerging practices in drug discovery utilize a number of biochemical 
techniques to provide for directed approaches to creating new drugs, rather than 

20 discovering them at random. For example, gene sequences and proteins encoded 

thereby that are required for the proliferation of an organism make for excellent targets 
since exposure of bacteria to compounds active against these targets would result in the 
inactivation of the organism. Once a target is identified, biochemical analysis of that 
target can be used to discover or to design molecules that interact with and alter the 

25 functions of the target. Using physical and computational techniques, to analyze 

structural and biochemical targets in order to derive compounds that interact with a 
target is called rational drug design and offers great future potential. Thus, emerging 
drug discovery practices use molecular modeling techniques, combinatorial chemistry 
approaches, and other means to produce and screen and/or design large numbers of 

30 candidate compounds. 

Nevertheless, while this approach to drug discovery is clearly the way of the 
future, problems remain. For example, the initial step of identifying molecular targets 
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for investigation can be an extremely time consuming task. It may also be difficult to 
design molecules that interact with the target by using computer modeling techniques. 
Furthermore, in cases where the function of the target is not known or is poorly 
understood, it may be difficult to design assays to detect molecules that interact with 
and alter the functions of the target. To improve the rate of novel drug discovery and 
development, methods of identifying important molecular targets in pathogenic 
microorganisms and methods for identifying molecules that interact with and alter the 
functions of such molecular targets are urgently required. 

Escherichia coli represents an excellent model system to understand bacterial 
biochemistry and physiology. The estimated 4288 geines scattered along the 4.6 x 10^ 
base pairs of the Escherichia coli (E. coli) chromosome offer tremendous promise for 
the understanding of bacterial biochemical processes. In turn, this knowledge will assist 
in the development of new tools for the diagnosis and treatment of bacteria-caused 
human disease. The entire E. coli genome has been sequenced, and this body of 
information holds a tremendous potential for application to the discovery and 
development of new antibiotic compounds. Yet, in spite of this accomplishment, the 
general functions or roles of many of these genes are still unknown. For example, the 
total number of proliferation-required genes contained within the E. coli genome is 
unknown, but has been variously estimated at around 200 to 700 (Armstrong, K.A. and 
Fan, D.P. Essential Genes in the metB-malB Region of Escherichia coli K12, 1975, J. 
Bacterid. 126:48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the 
rapid rise of antibiotic resistant microorganisms. However, prior to this invention, the 
characterization of even a single bacterial gene was a painstaking process, requiring 
years of effort. Accordingly, there is an urgent need for more novel methods to identify 
and characterize bacterial genomic sequences that encode gene products required for 
proliferation and for methods to identify molecules that interact with and alter the 
functions of such genes and gene products. 

SUMMARY OF THE INVENTION 



One embodiment of the present invention is a purified or isolated nucleic acid 
sequence consisting essentially of one of SEQ ID NOs: 1-81, 405-485, wherein said 



nucleic acid inhibits microorganism proliferation. The nucleic acid sequence may be 
complementary to at least a portion of a coding sequence of a gene whose expression is 
required for microorganism proliferation. The nucleic acid sequence may comprise a 
fragment of one of SEQ ID NOs. 1-81, 405-485, said fragment selected from the group 
5 consisting of fragments comprising at least 10, at least 20, at least 25, at least 30, at 
least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. The 
nucleic acid sequence may be complementary to a coding sequence of a gene whose 
expression is required for microorganism proliferation. 

Another embodiment of the present invention is a vector comprising a promoter 

10 operably linked to a nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs, 1-81, 405-485. The promoter may be active in an organism 
selected from the group consisting of Escherichia coll, Staphylococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 

15 influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 

Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, 
Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 
anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia 

20 trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 

the above species. 

Another embodiment of the present invention is a host cell containing the 
vectors described above. 

Another embodiment of the present invention is a purified or isolated nucleic 
25 acid consisting essentially of the coding sequence of one of SEQ ID NOs: 82-88, 90- 
242. One aspect of this embodiment is a fragment of the nucleic acid comprising at 
least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases 
of one of SEQ ID NOs: 82-88, 90-242. 

Another embodiment of the present invention isa vector comprising a promoter 
30 operably linked to the nucleic acids of the preceding embodiment. 

Another aspect of the present invention is a purified or isolated nucleic acid 
comprising a nucleic acid sequence complementary to at least a portion of an intragenic 
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sequence, intergenic sequence, sequences spanning at least a portion of two or more 
genes, 5' noncoding region, or 3' noncoding region within an operon encoding a 
polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs: 
243-357, 359-398. 

Another embodiment of the present invention is a purified or isolated nucleic 
acid comprising a nucleic acid having at least 70% homology to a sequence selected 
from the group consisting of SEQ ID NOs 1-81, 405-485, 82-88, 90-242 or the 
sequences complementary thereto as determined using BLASTN version 2.0 with the 
default parameters. The nucleic acid may be from an organism selected from the group 
consisting of Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, 
Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus 
pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, 
Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi, Salmonella 
cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium 
botulinum, Campylobacter jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae 
or any species falling within the genera of any of the above species. 

Another embodiment of the present invention is a purified or isolated nucleic 
acid consisting essentially of a nucleic acid encoding a polypeptide having a sequence 
selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is a vector comprising a promoter 
operably linked to a nucleic acid encoding a polypeptide having a sequence selected 
from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is a host cell containing the vector 
of the preceding embodiment. 

Another embodiment of the present invention is purified or isolated polypeptide 
comprising the sequence of one of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is purified or isolated polypeptide 
comprising a fragment of one of the polypeptides of SEQ ID NOs. 243-357, 359-398, 
said fragment selected from the group consisting of fragments comprising at least 5, at 
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least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the polypeptides of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is an antibody capable of 
specifically binding the polypeptide of the preceding embodiment. 

Another embodiment of the present invention is method of producing a 
polypeptide, comprising introducing a vector comprising a promoter operably linked to 
a nucleic acid encoding a polypeptide having a sequence selected from the group 
consisting of SEQ ID NOs. 243-357, 359-398into a cell. The method may further 
comprise the step of isolating said protein. 

Another embodiment of the present invention is a method of inhibiting 
proliferation comprising inhibiting the activity or reducing the amount of a polypeptide 
having a sequence selected from the group consisting of SEQ ID NOs. 243-357, 359- 
398or inhibiting the activity or reducing the amount of a nucleic acid encoding said 
polypeptide. 

Another embodiment of the present invention is method for identifying 
compounds v^^hich influence the activity of a polypeptide required for proliferation 
comprising: 

contacting a polypeptide comprising a sequence selected from the group 
consisting of 243-357, 359-398with a candidate compound; and 

determining whether said compound influences the activity of said polypeptide. 

The activity may be an enzymatic activity. The activity may be a carbon 
compound catabolism activity. The activity may be a biosynthetic activity. The activity 
may be a transporter activity. The activity may be a transcriptional activity. The 
activity may be a DNA replication activity. The activity may be a cell division activity. 

Another embodiment of the present invention is a compound identified using the 
above method. 

Another embodiment of the present invention is method for assaying compounds 
for the ability to reduce the activity or level of a polypeptide required for proliferation, 
comprising: 

providing a target, wherein said target comprises the coding sequence of a 
sequence selected from the group consisting of SEQ ID NOs. 82-88, 90-242; 
contacting said target with a candidate compound; and 
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measuring an activity of said target. 

The target may be a messenger RNA molecule transcribed from a coding region 
of one of SEQ ID. NOs.: 82-88, 90-242 and said activity is translation of said 
messenger RNA. The target may be a coding region of one of SEQ ID. NOs. 82-88, 90- 
5 242 and said activity is transcription of said messenger RNA. 

Another embodiment of the present invention is a compound identified using the 
method above. 

Another embodiment of the present invention is a method for identifying 
compounds which reduce the activity or level of a gene product required for ceil 

1 0 proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene 
product in a cell to reduce the activity or amount of said gene product in said cell, 
thereby producing a sensitized cell; 

contacting said sensitized cell with a compound; and 

1 ^ determining whether said compound inhibits the grovrth of said sensitized cell to 

a greater extent than said compound inhibits the growth of a nonsensitized cell. 

The cell may be selected from the group consisting of bacterial cells, fungal 
cells, plant cells, and animal cells. The cell may be an E. coli cell. The cell may be 
from an organism selected from the group consisting of Staphylococcus aureus, 

20 Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori. Neisseria 

gonorrhoeae, Enterococcus faecalis. Streptococcus pneumoniae, Haemophilus 
influenzae. Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, 

15 Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 
anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 
the above species. The antisense nucleic acid may be transcribed from an inducible 
promoter. The method may, further comprise the step of contacting said cell with a 

50 concentration of inducer which induces said antisense nucleic acid to a sublethal level. 
The sub-lethal concentration of said inducer may be such that growth inhibition is 8% 
or more. The inducer may be isopropyl-l-thio-p-D-galactoside. The growth inhibition 
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may be measured by monitoring optical density of a culture growth solution. The gene 
product may be a polypeptide. The gene product may be an RNA. The gene product 
may comprise a polypeptide having a sequence selected from the group consisting of 
SEQ ID NOs.: 243-357, 359-398. 

5 Another embodiment of the present invention is a compound identified using the 

method above. 

Another embodiment of the present invention is a method for inhibiting cellular 
proliferation comprising introducing a compound with activity against a gene 
corresponding to one of SEQ ID NOs.: 82-88, 90-242 or with activity against the 

10 product of said gene into a population of cells expressing a gene. The compound may 
be an antisense oligonucleotide comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-81, 405-485, or a proliferation-inhibiting portion thereof 
The proliferation inhibiting portion of one of SEQ ID NOs. 1-81, 405-485 may be a 
fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more 

15 than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. The compound may 
be a triple helix oligonucleotide. 

Another embodiment of the present invention is a preparation comprising an 
effective concentration of an antisense oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 1-81, 405-485, or a proliferation-mhibiting 

20 portion thereof in a pharmaceutically acceptable carrier. The proliferation-inhibiting 
portion of one of SEQ ID NOs. 1-81, 405-485 may comprise at least 10, at least 20, at 
least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID 
NOs: 1-81,405-485. 

Another embodiment of the present invention is a method for inhibiting the 

25 expression of a gene in an operon required for proliferation comprising contacting a cell 

in a cell population with an antisense nucleic acid, said cell expressing a gene 
corresponding to one of SEQ ID NOs.: 82-88, 90-242, wherein said antisense nucleic 
acid comprises at least a proliferation-inhibiting portion of said operon in an antisense 
orientation that is effective in inhibiting expression of said gene. The antisense nucleic 

30 acid may be complementary to a sequence of a gene comprising one or more of SEQ ID 
NOs.: 82-88, 90-242. The antisense nucleic acid may be a sequence of one of SEQ ID 
NOs.: 1-81, 405-485, or a portion thereof The cell may be contacted with said 
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antisense nucleic acid by introducing a plasmid which expresses said antisense nucleic 
acid into said cell population. The cell may be contacted with said antisense nucleic 
acid by introducing a phage which expresses said antisense nucleic acid into said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a 
5 sequence encoding said antisense nucleic acid into the chromosome of said cell into 

said cell population. The cell may be contacted with said antisense nucleic acid by 
introducing a retron which expresses said antisense nucleic acid into said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a 
ribozyme into said cell-population, wherein a binding portion of said ribozyme is 

10 complementary to said antisense oligonucleotide. The cell may be contacted with said 
antisense nucleic acid by introducing a liposome comprising said antisense 
oligonucleotide into said cell. The cell may be contacted with said antisense nucleic 
acid by electroporation. The antisense nucleic acid may be a fragment comprising at 
least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases 

15 of one of SEQ ID NOs: 82-88, 90-242. The antisense nucleic acid may be an 

oligonucleotide. 

Another embodiment of the present invention is a method for identifying 
bacterial strains comprising the steps of: 

providing a sample containing a bacterial species; and 
20 identifying a bacterial species using a species specific probe having a sequence 

selected from the group consisting of SEQ ID NOs. 1-81, 405-485, 82-88, 90-242. 

Another embodiment of the present invention is a method for identifying a gene 
in a microorganism required for proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity of a gene or 
25 gene product required for proliferation in a first microorganism; 

(b) contacting a second microorganism with said inhibitory nucleic acid; 

(c) determining whether said inhibitory nucleic acid from said first microorganism 
inhibits proliferation of said second microorganism; and 

(d) identifying the gene in said second microorganism which is inhibited by said 
30 inhibitory nucleic acid. 

Another embodiment of the present invention is a method for assaying a 
compound for the ability to inhibit proliferation of a microorganism comprising: 
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(a) identifying a gene or gene product required for proliferation in a first 
microorganism; 

(b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said 
5 homolog in said second microorgansim; 

(d) contacting said second microorganism with a proliferation-inhibiting amount of 
said inhibitory nucleic acid, thus sensitizing said second microorganism; 

(e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized 
0 microorganism to a greater extent than said compound inhibits proliferation of a 

nonsensitized microorganism. 

The step of identifying a gene involved in proliferation in a first microorganism may 
comprise: 

introducing a nucleic acid comprising a random genomic fragment from said 
5 first microorganism operably linked to a promoter wherein said random genomic 

firagment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a first level 
of said random genomic fragment to the proliferation of said first microorganism 
transcribing a lower level of said random genomic fragment, wherein a difference in 
0 proliferation indicates that said random genomic fragment comprises a gene involved in 
proliferation. 

The step of identifying a homolog of said gene in a second microorganism may 
comprise identifying a homologous nucleic acid or a nucleic acid encoding a 
homologous polypeptide in a database using an algorithm selected from the group 

5 consisting of BLASTN version 2.0 with the default parameters and FASTA version 

3.0t78 algorithm with the default parameters. The step of identifying a homolog of said 
gene in a second microorganism may comprise identifying a homologous nucleic acid 
or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids which 
hybridize to said first gene. The step of identifying a homolog of said gene in a second 

0 microorganism may comprise expressing a nucleic acid which inhibits the proliferation 
of said first microorganism in said second microorganism. The inhibitory nucleic acid 
may be an antisense nucleic acid. The inhibitory nucleic acid may comprise an 
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antisense nucleic acid to a portion of said homolog. The inhibitory nucleic acid may 
comprise an antisense nucleic acid to a portion of the operon encoding said homolog. 
The step of contacting the second microorganism with a proliferation-inhibiting amount 
of said nucleic acid sequence may comprise directly contacting said second 
5 microorganism with said nucleic acid. The step of contacting the second 

microorganism with a proliferation-inhibiting amount of said nucleic acid sequence may 
comprise expressing an antisense nucleic acid to said homolog in said second 
microorganism. 

Another embodiment of the present invention is a compound identified using the 
10 method above. 

Another embodiment of the present invention is a method of assaying a 
compound for the ability to inhibit proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the activity of a 
gene or gene product required for proliferation in a first microorgansim; 
15 (b) contacting a second microorganism v^th a proliferation-inhibiting amount of 

said inhibitory nucleic acid, thus sensitizing said second microorganism; 

(c) contacting the proliferation-inhibited microorganism of step (b) with a 
compound; and 

(d) determining whether said compound inhibits proliferation of said sensitized 
20 second microorganism to a greater extent than said compound inhibits proliferation of a 

nonsensitized second microorganism. 

The inhibitory nucleic acid may be an antisense nucleic acid which inhibits the 
proliferation of said first microorganism. The inhibitory nucleic acid may comprise a 
portion of an antisense nucleic acid which inhibits the proliferation of said first 
25 microorganism. The inhibitory nucleic acid may comprise an antisense molecule 

against the entire coding region of the gene involved in proliferation of the first 
microorganism. The inhibitory nucleic acid may comprise an antisense nucleic acid to a 
portion of the operon encoding the gene involved in proliferation of the first 
microorganism^ 

50 Another embodiment of the present invention is a compound identified using the 

method above. 
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Another embodiment of the present invention is a method for assaying 
compounds for activity against a biological pathway required for proliferation 
comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid 
encoding a gene product required for proliferation in a cell to reduce the activity or 
amount of said gene product; 

contacting the sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to 
a greater extent than said compound inhibits the growth of an nonsensitized cell. 

The cell may be selected from the group consisting of bacterial cells, fungal 
cells, plant cells, and animal cells. The cell may be an E. coli cell. The cell may be an 
organism selected from the group consisting of Staphylococcus aureus, Pseudomonas 
aeruginosa, Enterobacter cloacae, Helicobacter pylori. Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae. Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 
Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi. 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum, Bacillus anthracis. Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni, and Chlamydia trachomatus. Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. The 
antisense nucleic acid may be transcribed from an inducible promoter. The method may 
fiirther comprise contacting the cell with an agent which induces expression of said 
antisense nucleic acid from said inducible promoter, wherein said antisense nucleic acid 
is expressed at a sublethal level. The sublethal level of said antisense nucleic acid may 
inhibit proliferation by 8% or more. The agent may be isopropyl-l-thio-P-D- 
galactoside (IPTG). The inhibition of proliferation may be measured by monitoring the 
optical density of a liquid culture. The gene product may comprise a polypeptide 
having a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359- 
398. 

Another embodiment of the present invention is a compound identified using the 
method above. 
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Another embodiment of the present invention is a method for assaying a 
compound for the ability to inhibit cellular proliferation comprising: 

contacting a cell with an agent which reduces the activity or level of a gene 
product required for proliferation of said cell; 
5 contacting said cell with said compoxmd; and 

determining whether said compound reduces proliferation to a greater extent 
than said compound reduces proliferation of cells which have not been contacted with 
said agent. 

The agent which reduces the activity or level of a gene product required for 
10 proliferation of said cell may comprise an antisense nucleic acid to a gene or operon 
required for proliferation. The agent which reduces the activity or level of a gene 
product requu-ed for proliferation of said cell may comprise an antibiotic. The cell may 
contain a temperature sensitive mutation which reduces the activity or level of said gene 
product required for proliferation of said cell. The antisense nucleic acid may be 
15 directed against the same functional domain of said gene product required for 

proliferation of said cell to which said antisense nucleic acid is directed. The antisense 
nucleic acid may be directed against a different functional domain of said gene product 
required for proliferation of said cell than the fucntional domain to which said antisense 
nucleic acid is directed. 

2^ Another embodiment of the present invention is a compound identified using the 

method above. 

Another embodiment of the present invention is a method for identifying the 
pathway in which a proliferation-required nucleic acid or its gene product lies 
comprising: 

25 expressing a sublethal level of an antisense nucleic acid directed against said 

proliferation-required nucleic acid in a cell; 

contacting said cell with an antibiotic, wherein the a biological pathway on 
which said antibiotic acts is known; and 

determining whether said cell has a substantially greater sensitivity to said 
30 antibiotic than a cell which does not express said sublethal level of said antisense 
nucleic acid. 
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Another embodiment of the present invention is a method for determining the 
pathway on which a test compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a 
proliferation-required nucleic acid in a cell, wherein the biological pathway in which 

5 said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said 
test compound than a cell which does not express said sublethal level of said antisense 
nucleic acid. 

1 0 The method may further comprise: 

(d) expressing a sublethal level of a second antisense nucleic acid directed against a 
second proliferation-required nucleic acid in said cell, wherein said second 
proliferation-required nucleic acid is in a different biological pathway than said 
proliferation-required nucleic acid in step (a); and 

15 (e) determining whether said cell has a substantially greater sensitivity to said test 
compound than a cell which does not express said sublethal level of said second 
antisense nucleic acid. 

Another embodiment of the present invention is a purified or isolated nucleic 
acid consisting essentially of one of SEQ ID NOs: 358, 399-402. 
20 Another embodiment of the present invention is a purified or isolated nucleic 

acid comprising a sequence selected from the group consisting of 1-81, 405-485, 82-88, 
90-242, 358, 399-402. 

Another embodiment of the present invention is a compound which interacts 
with the gene or gene product of a nucleic acid comprising a sequence of one of SEQ 
25 ID NOs: 82-88, 90-242 to inhibit proliferation. 

Another embodiment of the present invention compound which interacts with a 
polypeptide comprising one of SEQ ID NOs. 243-357, 359-398 to inhibit proliferation. 

Another embodiment of the present invention is a compound which interacts 
with a nucleic acid comprising one of SEQ ID NOs: 358, 399-402 to inhibit 
30 proliferation. 

BRIEF DESCRIPTION OF THE DRAWINGS 
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Figure 1 is an IPTG dose response curve in E. coli transformed with an 
IPTG-inducible plasmid containing either an antisense clone to the E. coli ribosomal 
protein rplW (AS-rplW) which is required for protein synthesis and essential cell 
proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be 
5 involved in protein synthesis and which is also essential for proliferation. 

Figure 2A is a tetracycline dose response curve in E. coli transformed 
with an IPTG-inducible plasmid containing antisense to rplW(AS-rplW) in the 
presence of 0, 20 or 50 \iM IPTG. 

Figure 2B is a tetracycline dose response curve in E. coli transformed 
10 v^dth an IPTG-inducible plasmid containing antisense to elaD (AS-elaD) in the presence 
ofO, 20 or 50 ^iM IPTG. 

Figure 3 is a graph showing the fold increase in tetracycline sensitivity of 
E. coli transfected with antisense clones to essential ribosomal proteins L23 (AS-rplW) 
and L7/L12 and LIO (AS-rplLrplJ). Antisense clones to genes known not to be 
15 involved in protein synthesis (atpB/E(AS-atpB/E ), visC (AS-visC, elaD (AS-elaD), 

yohH (AS-yohH) are much less sensitive to tetracycline. 

Definitions 

By "biological pathway" is meant any discrete cell function or process that is 
carried out by a gene product or a subset of gene products. Biological pathways include 

20 enzymatic, biochemical and metabolic pathways as well as pathways involved in the 
production of cellular structures such cell walls. Biological pathways that are usually 
required for proliferation of microorganisms include, but are not limited to, cell 
division, DNA synthesis & replication, RNA synthesis (transcription), protein synthesis 
(translation), protein processing, protein transport, fatty acid biosynthesis, cell wall 

25 synthesis, cell membrane synthesis & maintenance, etc. 

By "inhibit activity against a gene or gene product" is meant having the ability 
to interfere with the function of a gene or gene product in such a way as to decrease 
expression of the gene or to reduce the level or activity of a product of the gene. Agents 
which have activity against a gene include agents that inhibit transcription of the gene 

30 and agents that inhibit translation of the mRNA transcribed from the gene. In 
microorganisms, agents which have activity against a gene can act to decrease 
expression of the operon in which the gene resides or alter the processing of operon 
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RNA such as to reduce the level or activity of the gene product. The gene product can 
be a non-translated RNA such as ribosomal RNA, a translated RNA (mRNA) or the 
protein product resuhing from translation of the gene mRNA. Of particular utility to 
the present invention are anti-sense RNAs that have activities against the operons or 

5 genes to which they specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the 
function or to reduce the level or activity of the gene product in a cell. 

By "activity against a protein" is meant having the ability to inhibit the function 
or to reduce the level or activity of the protein in a cell. 

0 By "activity against nucleic acid" is meant having the ability to inhibit the 

function or to reduce the level or activity of the nucleic acid in a cell. 

As used herein, "sublethal" means a concentration of an agent below the 
concentration required to inhibit all cell growth. 

DETAILED DESCRIPTION OF THE INVENTION 

5 The present invention describes a group of E. coli genes and gene families required 

for growth and/or proliferation. A proliferation-required gene or gene family is one where, 
in the absence of a gene transcript and/or gene product, growth or viability of the 
microorganism is reduced or eliminated. Thus, as used herein the terminology 
"proliferation-required" or "required for proliferation" encompasses sequences where the 

0 absence of a gene transcript and/or gene product completely eliminates cell growth as well 
as sequences where the absence of a gene transcript and/or gene product merely reduces 
cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also 
encompasses novel assays for analyzing proliferation-required genes and for identifying 

5 compounds which interact with the gene products of the proliferation-required genes. In 
addition, the present invention contemplates the expression of genes and the purification of 
the proteins encoded by the nucleic acid sequences identified as required prohferation 
genes and reported herein. The purified proteins can be used to generate reagents and 
screen small molecule libraries or other candidate compound libraries for compounds that 

) can be further developed to yield novel antimicrobial compounds. The present invention 
also describes methods for identification of homologous genes in organisms other tiian E. 
coli. 
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The present invention utilizes a novel method to identify proliferation-required E, 
coll sequences. Generally, a library of nucleic acid sequences from a given source are 
subcloned or otherwise inserted into an inducible expression vector, thus forming an 
expression library. Although the insert nucleic acids may be derived from the 
chromosome of the organism into which the expression vector is to be introduced, because 
the insert is not in its natural chromosomal location, the insert nucleic acid is an exogenous 
nucleic acid for the purposes of the discussion herein. The term expression is defined as 
the production of an RNA molecule from a gene, gene fi-agment, genomic fragment, or 
operon. Expression can also be used to refer to the process of peptide or polypeptide 
synthesis. An expression vector is defined as a vehicle by which a ribonucleic acid (RNA) 
sequence is transcribed from a nucleic acid sequence carried within the expression vehicle. 
The expression vector can also contain features that permit translation of a protein product 
from the transcribed RNA message expressed from the exogenous nucleic acid sequence 
carried by the expression vector. Accordingly, an expression vector can produce an RNA 
molecule as its sole product or the expression vector can produce a RNA molecule that is 
ultimately translated into a protein product. 

Once generated, the expression library containing the exogenous nucleic acid 
sequences is introduced into an E. coli population to search for genes that are required for 
bacterial proUferation. Because the library molecules are foreign to the population of E, 
coli, the expression vectors and the nucleic acid segments contained therein are considered 
exogenous nucleic acid. 

Expression of the exogenous nucleic acid fragments in the test population of E. coli 
containing the expression vector library is then activated. Activation of the expression 
vectors consists of subjecting the cells containing the vectors to conditions that resuU in the 
expression of the exogenous nucleic acid sequences carried by the expression vector 
library. The test population of £ coli cells is then assayed to determine the effect of 
expressing the exogenous nucleic acid fragments on the test population of cells. Those 
expression vectors that, upon activation and expression, negatively impact the growth of 
the E. coli screen population were identified, isolated, and purified for further study. 

A variety of assays are contemplated to identify nucleic acid sequences that 
negatively impact growtii upon expression. In one embodiment, growth in E, coli cultures 
expressing exogenous nucleic acid sequences and growth in cultures not expressing tiiese 
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sequences is compared. Growth measurements are assayed by examining the extent of 
growth by measuring optical densities. Alternatively, enzymatic assays can be used to 
measure bacterial growth rates to identify exogenous nucleic acid sequences of interest. 
Colony size, colony morphology, and cell morphology are additional factors used to 
evaluate growth of the host cells. Those cultures that failed to grow or grow with reduced 
efficiency under expression conditions are identified as containing an expression vector 
encoding a nucleic acid fragment that negatively affects a proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are 
analyzed. The first step of the analysis is to acquire the nucleic acid sequence of the 
nucleic acid fi:agment of interest. To achieve this end, the insert in those expression 
vectors identified as containing a sequence of interest is sequenced, using standard 
techniques well known in the art The next step of the process is to determine the source 
of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence 
data with known sequences in various genetic databases. The sequences identified are 
used to probe these gene databases. The result of this procedure is a list of exogenous 
nucleic acid sequences corresponding to a list that includeds novel bacterial genes required 
for proliferation as well as genes previously identified as required for proliferation. 

The number of DNA and protein sequences available in database systems has been 
growing exponentially for years. For example, at the end of 1998, the complete sequences 
of Caenorhabditis elegans, Saccharomyces cerevisiae and nineteen bacterial genomes, 
including E. coli were available. This sequence information is stored in a number of 
databanks, such as GenBank (the National Center for Biotechnology Information (NCBI), 
and is publicly available for searching. 

A variety of computer programs are available to assist in the analysis of the 
sequences stored within these databases. FastA, (W. R. Pearson (1990) "Rapid and 
Sensitive Sequence Comparison with FASTP and FASTA" Methods in Enzymology 
183:63- 98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing and 
retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9:49-57, 1993) are two 
examples of computer programs that can be used to analyze sequences of interest. In 
one embodiment of the present invention, the BLAST family of computer programs, 
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which includes BLASTN version 2,0 with the default parameters, or BLASTX version 
2.0 with the default parameters, is used to analyze nucleic acid sequences. 

BLAST, an acronym for "Basic Local Alignment Search Tool " is a family of 
programs for database similarity searching. The BLAST family of programs includes: 
5 BLASTN, a nucleotide sequence database searching program, BLASTX, a protein 

database searching program where the input is a nucleic acid sequence; and BLASTP, a 
protein database searching program. BLAST programs embody a fast algorithm for 
sequence matching, rigorous statistical methods for judging the significance of matches, 
and various options for tailoring the program for special situations. Assistance in using the 

1 0 program can be obtained by e-mail at blast@ncbi.nlm.nih. gov. 

Bacterial genes are often transcribed in polycistronic groups. These groups 
comprise operons, which are a collection of genes and intergenic sequences. The genes of 
an operon are co-transcribed and are often related fiznctionally. Given the nature of the 
screening protocol, it is possible that the identified exogenous nucleic acid sequence 

1 5 corresponds to a gene or portion thereof with or without adjacent noncoding sequences, an 

intragenic sequence (i.e. a sequence within a gene), an intergenic sequence (i.e. a sequence 
between genes), a sequence spanning at least a portion of two or more genes, a 5' 
noncoding region or a 3' noncoding region located upstream or downstream fi-om the 
actual sequence that is required for bacterial proliferation. Accordingly, determining 

20 which of the genes that are encoded within the operons are individually required for 

proliferation is often desirable. 

In one embodiment of the present invention, an operon is dissected to determine 
which gene or genes are required for proliferation. For example, the RegulonDB 
DataBase described by Huerta et al. {Nucl Acids Res. 26:55-59, 1998), which may also be 

25 found on the website http://www.cifn.unam.mx/Computational_Biology/regulondb/, may 

be used, to identify the boundaries of operons encoded within microbial genomes. A 
number of techniques that are well known in the art can be used to dissect the operon. In 
one aspect of this embodiment, gene disruption by homologous recombination is used to 
individually inactivate the genes of an operon that is thought to contain a gene required for 

30 proliferation. 

Several gene disruption techniques have been described for the replacement of a 
ftmctional gene with a mutated, non-fimctional (null) allele. These techniques generally 
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involve the use of homologous recombination. The method described by Link et al. (J. 
Bacteriol 1997 179:6228; incorporated herein by reference in it's entirety) serves as an 
excellent example of these methods as applicable to disruption of genes in E. coli. This 
technique uses crossover PGR to create a null allele with an in-frame deletion of the 
coding region of a target gene. The null allele is constructed in such a way that 
sequences adjacent to the wild type gene (ca. 500 bp) are retained. These homologous 
sequences surrounding the deletion null allele provide targets for homologous 
recombination so that the wild type gene on the E. coli chromosome can be replaced by 
the constructed null allele. 

The crossover PGR amplification product is subcloned into the vector pK03, the 
features of which include a chloramphenicol resistance gene, the counter-selectable 
marker sacB, and a temperature sensitive autonomous replication function. Following 
transformation of an E. coli cell population with such a vector, selection for cells that 
have undergone homologous recombination of the vector into the chromosome is 
achieved by growth on chloramphenicol at the non-permissive temperature of 43''G, 
Under these conditions, autonomous replication of the plasmid cannot occur and cell are 
resistant to chloramphinicol only if the chloramphenicol resistance gene has been 
integrated into the chromosome. Usually a single crossover event is responsible for 
this integration event such that the E. coli chromosome now contains a tandem 
duplication of the target gene consisting of one wild type allele and one deletion null 
allele separated by vector sequence. 

This new E, coli strain containing the tandem duplication can be maintained at 
permissive temperatures in the presence of drug selection (chloramphenicol). 
Subsequently, cells of this new strain are cultured at the permissive temperature 30°G 
without drug selection. Under these conditions, the chromosome of some of the cells 
within the population will have undergone an internal homologous recombination event 
resulting in removal of the plasmid sequences. Subsequent culturing of the strain in 
growth medium lacking chloramphenicol but containing sucrose is used to select for 
such recombinative resolutions. In the presence of the coimter-selectable marker sacB, 
sucrose is rendered into a toxic metabolite. Thus, cells that survive this counter- 
selection have lost both the plasmid sequences from the chromosome and the 
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autonomously replicating plasmid that results as a byproduct of recombinative 
resolution. 

There are two possible outcomes of the above recombinative resolution via 
homologous recombination. Either the wild type copy of the targeted gene is retained 
on the chromosome or the mutated null allele is retained on the chromosome. In the 
case of an essential gene, a single copy of the null allele would be lethal and such cells 
should not be obtained by the above procedure when applied to essential genes. In the 
case of a non-essential gene, roughly equal numbers of cells containing null alleles and 
cells containing wild type alleles should be obtained. Thus, the method serves as a test 
for essentiality of the targeted gene: when applied to essential genes, only cells with a 
wild type allele on the chromosome will be obtained. 

Other techniques have also been described for the creation of disruption 
mutations in E, coli. For example. Link et aL also describe inserting an in-frame 
sequence tag concommitantly with an in-frame deletion in order to simplify analysis of 
recombinants obtained. Further, Link et al. describe disruption of genes with a drug 
resistance marker such as a kanamycin resistance gene. Arigoni et al, (Arigoni, F. et al. 
A Genome-based Approach for the Identification of Essential Bacterial Genes, Nature 
Biotechnology 16: 851-856, the disclosure of which is incorporated herein by reference 
in its entirety) describe the use of gene disruption combined with engineering a second 
copy of a test gene such that the expression of the gene is regulated by and inducible 
promoter such as the arabinose promoter to test the essentiality of the gene. Many of 
these techniques result in the insertion of large fragments of DNA into the gene of 
interest, such as a drug selection marker. An advantage of the technique described by 
Link et al. is that it does not rely on an insertion into the gene to cause a functional 
defect, but rather resuhs in the precise removal of the coding region. This insures the 
lack of polar effects on the expression of genes dovmstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences 
of the gene identified as required for proliferation, or portions thereof The over-expressed 
proteins can be used as reagents for further study. The identified exogenous sequences are 
isolated, purified, and cloned into a suitable expression vector using methods well known 
in the art. If desired, the nucleic acids can contain the sequences encoding a signal peptide 
to facilitate secretion of the expressed protein. 
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Expression of fragments of the bacterial genes identified as required for 
proliferation is also contemplated by the present invention. The fragments of the identified 
genes can encode a polypeptide comprising at least 5, at least 10, at least 15, at least 20, at 
least 25, at Ipast 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at 
5 least 65, at least 75, or more than 75 consecutive amino acids of a gene complementary to 
one of the identified sequences of the present invention. The nucleic acids inserted into the 
expression vectors can also contain sequences upstream and downstream of the coding 
sequence. 

When expressing the coding sequence of an entire gene identified as required for 

10 bacterial proliferation or a fragment thereof, the nucleic acid sequence to be expressed is 
operably linked to a promoter in an expression vector using conventional cloning 
technology. The expression vector can be any of the bacterial, insect, yeast, or mammalian 
expression systems known in the art. Commercially available vectors and expression 
systems are available from a variety of suppliers including Genetics Institute (Cambridge, 

15 MA), Stratagene (La JoUa, California), Promega (Madison, Wisconsin), and Invitrogen 

(San Diego, California). If desired, to enhance expression and facilitate proper protein 
folding, the codon usage and codon bias of the sequence can be optimized for the 
particular expression organism in which the expression vector is introduced, as explained 
by Hatfield, et al., U.S. Patent No. 5,082,767, incorporated herein by tliis reference. 

20 Fusion protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic 
acid sequence, the protein is purified. Protein purification techniques are well known in 
the art. Proteins encoded and expressed from identified exogenous nucleic acid sequences 
can be partially purified using precipitation techniques, such as precipitation with 

25 polyethylene glycol. Chromatographic methods usable with the present invention can 
include ion-exchange chromatography, gel filtration, use of hydroxyapaptite columns, 
immobilized reactive dyes, chromatofocusing, and use of high-performance liquid 
chromatography. Electrophoretic methods such one-dimensional gel electrophoresis, high- 
resolution two-dimensional polyacrylamide electrophoresis, isoelectric focusing, and 

30 others are contemplated as purification methods. Also, affinity chromatographic methods, 
comprising antibody columns, ligand presenting columns and other affinity 
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chromatographic matrices are contemplated as purification methods in the present 
invention. 

The purified proteins produced fi'om the gene coding sequences identified as 
required for proUferation can be used in a variety of protocols to generate useful 
5 antimicrobial reagents. In one embodiment of the present invention, antibodies are 
generated against the proteins expressed fi:om the identified exogenous nucleic acid 
sequences. Both monoclonal and polyclonal antibodies can be generated against the 
expressed proteins. Methods for generating monoclonal and polyclonal antibodies are well 
known in the art. Also, antibody fragment preparations prepared from the produced 

1 0 antibodies discussed above are contemplated. 

Another application for the purified proteins of the present invention is to screen 
small molecule libraries for candidate compounds active against the various target proteins 
of the present invention. Advances in the field of combinatorial chemistry provide 
methods, well known in the art, to produce large numbers of candidate compounds that can 

15 have a binding, or otherwise inhibitory effect on a target protein. Accordingly, the 
screening of small molecule libraries for compounds with binding affinity or inhibitory 
activity for a target protein produced from an identified gene sequence is contemplated by 
the present invention. 

The present invention fiorther contemplates utility against a variety of other 

20 patiiogenic organisms in addition to E. coll For example, the invention has utility in 
identifying genes required for proliferation in prokaryotes and eukaryotes. For example, 
tiie invention has utility with protists, such as Plasmodium spp.; plants; animals, such as 
Entamoeba spp. and Contracaecum spp; and fungi including Candida spp., (e.g., 
Candida albicans), Saccharomyces cerevisiae, Cryptococcus neoformans, and Aspergillus 

15 jumigatus. In one embodiment of the present invention, monera, specifically bacteria are 
probed in search of novel gene sequences required for proUferation. This embodiment is 
particularly important given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics 
are growing. A partial list of these organisms includes: Staphylococcus spp., such as S. 

50 aureus; Enterococcus spp., such as E. faecalis; Pseudomonas spp., such as P. 
aeruginosa, Clostridium spp., such as C botulinum, Haemophilus spp., such as H. 
influenzae, Enterobacter spp., such as E. cloacae, Vibrio spp., such as V. cholera; 



-23- 



Moraxala spp., such as M catarrhalis; Streptococcus spp., such as S. pneumoniae, 
Neisseria spp., such as A^. gonorrhoeae; Mycoplasma spp., such as Mycoplasma 
pneumoniae; Salmonella typhimurium; Helicobacter pylori; Escherichia coli; and 
Mycobacterium tuberculosis. The sequences identified as required for proliferation in 
5 the present invention can be used to probe these and other organisms to identify 

homologous required proliferation genes contained therein. 

In one embodiment of the present invention, the nucleic acid sequences disclosed 
herein are used to screen genomic libraries generated from bacterial species of interest 
other than E, coli. For example, the genomic library may be from Staphylococcus aureus, 

10 Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 

gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 
influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, 

15 Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 

anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 
the above species. Standard molecular biology techniques are used to generate genomic 
libraries from various microorganisms. In one aspect, the libraries are generated and 

20 boimd to nitrocellulose paper. The identified exogenous nucleic acid sequences of the 
present invention can then be used as probes to screen the libraries for homologous 
sequences. The homologous sequences identified can then be used as targets for the 
identification of nev^, antimicrobial compounds with activity against more than one 
organism. 

25 For example, the preceding methods may be used to isolate nucleic acids having 

a sequence with at least 97%, at least 95%, at least 90%, at least 85%>, at least 80%, or at 
least 70% identity to a nucleic acid sequence selected from the group consisting of one 
of the sequences of SEQ ID NOS. 1-81, 405-485, 82-88, 90-242, fragments comprising 
at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 

30 bases thereof, and the sequences complementary thereto. Identity may be measured 
using BLASTN version 2.0 with the default parameters, (Altschul, S.F. et al. Gapped 
BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs, 
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Nucleic Acid Res. 25: 3389-3402 (1997), the disclosure of which is incorporated herein 
by reference in its entirety). For example, the homologous polynucleotides may have a 
coding sequence which is a naturally occurring allelic variant of one of the coding 
sequences described herein. Such allelic variants may have a substitution, deletion or 
5 addition of one or more nucleotides when compared to the nucleic acids of SEQ ID 

NOs: 1-81, 405-485, 82-88, 90-242 or the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which 
encode polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80%, 
at least 70%, at least 60%, at least 50%, or at least 40% identity or similarity to a 

10 polypeptide having the sequence of one of SEQ ID NOs: 243-357, 359-398or fragments 
comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino 
acids thereof as determined using the FASTA version 3,0t78 algorithm with the default 
parameters. Alternatively, protein identity or similarity may be identified using 
BLASTP with the default parameters, BLASTX with the default parameters, or 

15 TBLASTN with the default parameters. (Alschul, S.F. et al Gapped BLAST and PSI- 

BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 
25: 3389-3402 (1997), the disclosure of which is incorporated herein by reference in its 
entirety). 

Alternatively, homologous nucleic acids or polypeptides may be identified by 
20 searching a database to identify sequences having a desired level of homology to a 

nucleic acid or polypeptide involved in proliferation or an antisense nucleic acid to a 
nucleic acid involved in microbial proliferation. A variety of such databases are 
available to those skilled in the art, including GenBank and GenSeq. In some 
embodiments, the databases are screened to identify nucleic acids or polypeptides 
25 having at least 97%>, at least 95%, at least 90%, at least 85%>, at least 80%, at least 70%, 

at least 60%, or at least 50%, at least 40% identity or similarity to a nucleic acid or 
polypeptide involved in proliferation or an antisense nucleic acid involved in 
proliferation. For example, the database may be screened to identify nucleic acids 
homologous to one of SEQ ID Nos. 1-81, 405-485, 82-88, 90-242 or polypeptides 
30 homologous to SEQ ID NOs. 243-357, 359-398. In some embodiments, the database 
may be screened to identify homologous nucleic acids or polypeptides from organisms 
other than E. colU including organisms such as Staphylococcus aureus, Pseudomonas 
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aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 
Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi. Salmonella paratyphi, 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum. Bacillus anthracis. Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni,Chlamydia trachomatus, Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. 

In another embodiment, gene expression arrays and microarrays can be 
employed. Gene expression arrays are high density arrays of DNA samples deposited at 
specific locations on a glass chip, nylon membrane, or the like. Such arrays can be used 
by researchers to quantify relative gene expression under different conditions. Gene 
expression arrays are used by researchers to help identify optimal drug targets, profile 
new compounds, and determine disease pathways. An example of this technology is 
found in U.S. Patent No. 5807522, which is hereby incorporated by reference. 

It is possible to study the expression of all genes in the genome of a particular 
microbial organism using a single array. For example, the arrays from Genosys consist 
of 12 X 24 cm nylon filters containing PGR products corresponding to 4290 ORFs from 
£. coli. 10 ngs of each are spotted every 1.5 mm on the filter. Single stranded labeled 
cDNAs are prepared for hybridization to the array (no second strand synthesis or 
amplification step is done) and placed in contact with the filter. Thus the labeled 
cDNAs are of "antisense" orientation. Quantitative analysis is done by 
phosphorimager. 

Hybridization of cDNA made fi-om a sample of total cell mRNA to such an 
array followed by detection of binding by one or more of various techniques known to 
those in the art results in a signal at each location on the array to which cDNA 
hybridized. The intensity of the hybridization signal obtained at each location in the 
array thus reflects the amount of mRNA for that specific gene that was present in the 
sample. Comparing the results obtained for mRNA isolated from cells grown under 
different conditions thus allows for a comparison of the relative amount of expression 
of each individual gene during growth under the different conditions. 
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Gene expression arrays may be used to analyze the total mRNA expression 
pattern at various time points after induction of an antisense nucleic acid against a 
proliferation-required gene. Analysis of the expression pattern indicated by 
hybridization to the array provides information on whether or not the target gene of the 
antisense nucleic acid is being affected by antisense induction, how quickly the 
antisense is affecting the target gene, and for later timepoints, what other genes are 
affected by antisense expression. For example, if the antisense is directed against a 
gene for ribosomal protein L7/L12 in the SOS subunit, its targeted mRNA may 
disappear first and then other mRNAs may be observed to increase, decrease or stay the 
same. Similarly, if the antisense is directed against a different SOS subunit ribosomal 
protein mRNA (e.g, L25), that mRNA may disappear first followed by changes in 
mRNA expression that are similar to those seen with the L7/L12 antisense expression. 
Thus, the mRNA expression pattern observed with an antinsense nucleic acid against a 
proliferation required gene may identify other proliferation-required nucleic acids in the 
same pathway as the target of the antisense nucleic acid. In addition, the mRNA 
expression patterns observed with candidate drug compounds may be compared to those 
observed with antisense nucleic acids against a proliferation-required nucleic acid. If 
the mRNA expression pattern observed with the candidate drug compound is similar to 
that observed with the antisense nucleic acid, the drug compound may be a promising 
therapeutic candidate. Thus, the assay would be useful in assisting in the selection of 
candidate drug compounds for use in screening methods such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source 
of the nucleic acid being hybridized to the array are from two different organisms, gene 
expression arrays can identify homologous genes in the two organisms. 

The present invention also contemplates additional methods for screening other 
microorganisms for proliferation-required genes. In this embodiment, the conserved 
portions of sequences identified as proliferation-required can be used to generate 
degenerate primers for use in the polymerase chain reaction (PGR). The PGR technique is 
well known in the art. The successful production of a PGR product using degenerate 
probes generated firom the sequences identified herein would indicate the presence of a 
homologous gene sequence in the species being screened. This homologous gene is then 
isolated, expressed, and used as a target for candidate antibiotic compounds. In another 
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aspect of this embodiment, the homologous gene is expressed in an autologous organism 
or in a heterologous organism in such a way as to alter the level or activity of a 
homologous gene required for proliferation in the autologous or heterologus organism. In 
still another aspect of this embodiment, the homologous gene or portion is expressed in an 
5 antisense orientation in such a way as to alter the level or activity of a nucleic acid required 

for proUferation of an autologous or heterologous organism. 

The homologous sequences to proliferation-required genes identified using the 
techniques described herein may be used to identify proliferation-required genes of 
organisms other than E, coli, to inhibit the proliferation of organisms other than E. coli 

10 by inhibiting the activity or reducing the amount of the identified homologous nucleic 
acid or polypeptide in the organism other than E. coli, or to identify compounds which 
inhibit the growth of organisms other than E. coli as described below. 

In another embodiment of the present invention, E. coli sequences identified as 
required for proliferation are transferred to expression vectors capable of function within 

15 noU'E coli species. As would be appreciated by one of ordinary skill in the art, 

expression vectors must contain certain elements that are species specific. These 
elements can include promoter sequences, operator sequences, repressor genes, origins 
of replication, ribosomal binding sequences, termination sequences, and others. To use 
the identified exogenous sequences of the present invention, one of ordinary skill in the 

20 art would know to use standard molecular biology techniques to isolate vectors 
containing the sequences of interest from cultured bacterial cells, isolate and purify 
those sequences, and subclone those sequences into an expression vector adapted for 
use in the species of bacteria to be screened. 

Expression vectors for a variety of other species are known in the art. For 

25 example, Cao et al. report the expression of steroid receptor fragments in 

Staphylococcus aureus, i. Steroid Biochem Mol BioL 44(1):1-1 1 (1993). Also, Pla et 
al. have reported an expression vector that is functional in a number of relevant hosts 
including: Salmonella typhimurium, Pseudomonas putida, and Pseudomonas 
aeruginosa. 3. BacterioL 172(8):4448-55 (1990). These examples demonstrate the 

30 existence of molecular biology techniques capable of constructing expression vectors 
for the species of bacteria of interest to the present invention. 
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Following the subcloning of the identified nucleic acid sequences into an 
expression vector functional in the microorganism of interest, the identified nucleic acid 
sequences are conditionally transcribed to assay for bacterial growth inhibition. Those 
expression vectors found to contain sequences that, when transcribed, inhibit bacterial 
5 growth are compared to the known genomic sequence of the pathogenic microorganism 

being screened or, if the homologous sequence from the organism being screened is not 
known, it may be identified and isolated by hybridization to the proliferation-required 
E. coll sequence interest or by ampHfication using primers based on the proliferation- 
required E. coli sequence of interest as described above. 

10 The antisense sequences from the second organism which are identified as 

described above may then be operably linked to a promoter, such as an inducible 
promoter, and introduced into the second organism. The techniques described herein 
for identifying E. coli genes required for proliferation may thus be employed to determine 
whether the identified sequences fi:om a second organism inhibit the proliferation of the 

1 5 second organism. 

Antisense nucleic acids required for the proUferation of organisms other than E, 
coli or the genes corresponding thereto, may also be hybridized to a microarray 
containing the E, coli ORFs to gauge the homology between the E. coli sequences and 
the proliferation-required nucleic acids from other organisms. For example, the 

20 proliferation-required nucleic acid may be from Staphylococcus aureus, Pseudomonas 

aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 
Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi, Salmonella paratyphi, 

25 Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 

Mycobacterium leprae, Treponema pallidum, bacillus anthracis, Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni or Chlamydia trachomatus. Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. The 
proliferation-required nucleic acids from an organism other than E. coli may be 

30 hybridized to the array under a variety of conditions which permit hybridization to 

occur when the probe has different levels of homology to the sequence on the 



-29- 



microarray. This would provide an indication of homology across the organisms as 
well as clues to other possible essential genes in these organisms. 

In still another embodiment, the exogenous nucleic acid sequences of the present 
invention that are identified as required for bacterial growth or proUferation can be used as 
5 antisense therapeutics for killing bacteria. The antisense sequences can be directed against 

the proliferation-required genes whose sequence corresponds to the exogenous nucleic 
acid probes identified here (i.e. the antisense nucleic acid may hybridize to the gene or a 
portion thereof). Alternatively, antisense therapeutics can be directed against operons in 
which proliferation-required genes reside (i.e. the antisense nucleic acid may hybridize to 
10 any gene in the operon in which the proliferation-required genes reside). Further, 
antisense therapeutics can be directed against a proliferation-required gene or portion 
thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a 
sequence within a gene), an intergenic sequence (i.e. a sequence between genes), a 
^ sequence spanning at least a portion of two or more genes, a 5' noncoding region or a 3' 

% 15 noncoding region located upstream or downstream from the actual sequence that is 

fy required for bacterial proliferation or an operon containing a proliferation-required gene. 

□ In addition to therapeutic appHcations, the present invention encompasses the use 
^ of nucleic acid sequences complementary to sequences required for proliferation as 

□ diagnostic tools. For example, nucleic acid probes complementary to proliferation- 
Lj 20 required sequences that are specific for particular species of microorganisms can be used 

as probes to identify particular microorganism species in clinical specimens. This utility 
g provides a rapid and dependable method by which to identify the causative agent or agents 

of a bacterial infection. This utility would provide clinicians the ability to prescribe 
species specific antimicrobial compounds to treat such infections. In an extension of this 
25 utility, antibodies generated against proteins translated from mRNA transcribed from 

proliferation-required sequences can also be used to screen for specific microorganisms 
that produce such proteins in a species-specific manner. 

The following examples teach the genes of the present invention and a subset of 
uses for the E. coli genes identified as required for proUferation. These examples are 
30 illustrative only and are not intended to limit the scope of the present invention. 

EXAMPLES 
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The following examples are directed to the identification and exploitation of E. coli 
genes required for proliferation. Methods of gene identification are discussed as well as a 
variety of methods to utilize the identified sequences. 

5 Genes Identified as Required for Proliferation of coli 

Exogenous nucleic acid sequences were cloned into an inducible expression vector 
and assayed for growth inhibition activity. Example 1 describes the examination of a 
library of exogenous nucleic acid sequences cloned into IPTG-inducible expression 
vectors. Upon activation or induction, the expression vectors produced an RNA molecule 

1 0 corresponding to the subcloned exogenous nucleic acid sequences. The RNA product was 
in an antisense orientation with respect to the E. coli genes fi:om which it was originally 
derived. This antisense RNA then interacted with sense mRNA produced from various E. 
coli genes and interfered with or inhibited the translation of the sense messenger RNA 
(mRNA) thus preventing protein production from these sense mRNA molecules. In cases 

15 where the sense mRNA encoded a protein required for the proliferation, bacterial cells 
containing an activated expression vector failed to grow or grew at a substantially reduced 
rate. 

EXAMPLE 1 

Inhibition of Bacterial Proliferation after IPTG induction 
20 To study the effects of transcriptional induction in liquid medium, growth curves 

were carried out by back diluting cultures 1:200 into fresh media with or without 1 mM 
IPTG and measuring the OD450 every 30 minutes (min). To study the effects of 
transcriptional induction on solid medium, 10^ 10^ 10^ 10^ 10^ 10^ and 10^ fold 
dilutions of overnight cultures were prepared. Aliquots of from 0.5 to 3 j^l of these 
25 dilutions were spotted on selective agar plates with or without 1 mM IPTG. After 

overnight incubation, the plates were compared to assess the sensitivity of the clones to 
IPTG. 

Of the numerous clones tested, some clones were identified as a containing 
sequence that inhibited E. coli growth after IPTG induction. Accordingly, the gene to 
30 which the inserted nucleic acid sequence corresponds, or a gene within the operon 

containing the inserted nucleic acid, may be required for proliferation in E. coli. 
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Characterization of Isolated Clones Negatively Affecting £: coli Proliferation 

Following the identification of those expression vectors that, upon expression, 
negatively impacted E. coli growth or proliferation, the inserts or nucleic acid fragments 
contained in those expression vectors were isolated for subsequent characterization. 
5 Expression vectors of interest were subjected to nucleic acid sequence determination. 

EXAMPLE 2 

Nucleic Acid Sequence Determination of Identified Clones Expressing Nucleic Acid 
Fragments with Detrimental Effects of £ coli Proliferation 
The nucleotide sequences for the exogenous identified sequences were determined 
10 using plasmid DNA isolated using QIAPREP (Qiagen, Valencia, CA) and methods 
supplied by the manufacturer. The primers used for sequencing the inserts were 5' - 
TGTTTATCAGACCGCTT - 3' (SEQ ID NO: 403) and 5' - 
ACAATTTCACACAGCCTC - 3' (SEQ ID NO: 404). These sequences flank the 
polylinker in pLEXSBA. Sequence identification numbers (SEQ ID NOs) for the 
1 5 identified inserts are listed in Table I and discussed below. 

EXAMPLE 3 
Comparison Of Isolated Sequences to Known Sequences 
The nucleic acid sequences of the subcloned fi:agments obtained from the 
expression vectors discussed above were compared to known E. coli sequences in 
20 GenBank using BLAST version 1.4 or version 2.0.6 using the following default 
parameters: Filtering off, cost to open a gap=5, cost to extend a gap=2, penalty for a 
mismatch in the blast portion of run=-3, reward for a match in the blast portion of run=l, 
expectation value (e)=10.0, word size=ll, number of one-line descriptions=100, number 
of alignments to show (B)=100. BLAST is described in Altschul, J Mol Biol. 215:403-10 
25 (1990), the disclosure of which is incorporated herein by reference in its entirety. 

Expression vectors were found to contain nucleic acid sequences in both the sense and 
antisense orientations. The presence of known genes, open reading firames, and ribosome 
binding sites was determined by comparison to public databases holding genetic 
information and various computer programs such as the Genetics Computer Group 
30 programs FRAMES and CODONPREFERENCE. Clones were designated as "antisense" 

if the cloned firagment was oriented to the promoter such that the RNA transcript produced 
was complementary to the expressed mRNA firom a chromosomal locus. Clones were 
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designated as "sense" if they coded for an RNA fragment that was identical to a portion of 

a wild type mRNA from a chromosomal locus. 

The sequences described in Examples 1-2 that inhibited bacterial proliferation and 

contained gene fragments in an antisense orientation are listed in Table L This table Usts 
5 each identified sequence by: a sequence identification number; a Molecule Number; a gene 

to which the identified sequence corresponds, listed according to the National Center for 

Biotechnology Information (NCBI), Blattner (Science 277:1453-1474(1997); also 

contains the E. coli K-12 genome sequence), or Rudd (Micro, and MoL Rev. 62:985- 

1019 (1998)), (both papers are hereby incorporated by reference) nomenclatures. The 
1 0 CONTIG numbers for each identified sequence is shown, as well as the location of the first 

and last base pairs located on the E, coli chromosome. A Molecule Number with a 

indicates a clone corresponding to an intergenic sequence. 

The sequences of the nucleic acid inserts of SEQ ID NOs: 1-81 from U.S. 

Provisional Patent Application No. 60/1 17,405 which inhibited proliferation were fiarther 
15 analyzed. The reanalyzed sequences corresponding to SEQ ID NOs. 1-81 of U.S. 

Provisional Patent Application No. 60/117,405 have SEQ ID NOs. 405-485 in the 

present application. 

SEQ ID NOs: 82-242 in U.S. Provisional Patent Application No. 60/1 17,405 are 
identical to SEQ ID NOs: 82-242 of the present application with the following 

20 exceptions. SEQ ID NO: 148 in the present application is the complementary strand of 

SEQ ID NO: 148 in U.S. Provisional Patent Application No. 60/117,405. Accordingly, 
the protein of SEQ ID NO: 308 which is encoded by SEQ ID NO: 148 has also been 
revised. SEQ ID NO: 163 in the present application is the complementary strand of 
SEQ ID NO: 163 in U.S. Provisional Patent Application No. 60/1 17,405, Accordingly, 

25 the protein of SEQ ID NO: 323 which is encoded by SEQ ID NO: 163 has also been 
revised. 

The target gene of SEQ ID NOs. 18 and 19 of U.S. Provisional Patent 
Application No. 60/117,405 (SEQ ID NOs. 18, 19, 422, 423 of the present apphcation) 
has been revised from dicF to ftsZ to reflect the fact that these SEQ ID NOs. include 
30 natural antisense molecules which inhibit ftsZ expression. 

The gene products of the nucleic acids of SEQ ID NOs. 198 and 239-242 in U.S. 
Provisional Patent Application No. 60/117,405 and in the present application (SEQ ID 
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NOs. 358 and 399-402 of the present application) have been revised to reflect the fact 
that these nucleic acids encode nontranslated tRNAs and rRNAs. Tables I and II have 
been revised accordingly. The SEQ ID NOs. in Table II were also revised to reflect the 
fact that SEQ ID NOs: 89 and 402 were identical in U.S. Provisional Patent Application 
No. 60/117,405. 

TABLE I 

Identified Clones with Corresponding Genes and Operons 



SEQ 
ID 
NO. 


Molecule 
No. 


Gene 
(NCBI) 


Gene 
(Blattner) 


Gene 
(Rudd) 


CONTIG 


1,405 


EcXAOOl 


yhhQ 


b3471 


yhhQ 


AE000423 


2, 406 


EcXA002 


lepB 


lepB 


lepB 


AE000343 


3,407 


EcXA003 


f586 


b0955 


ycbZ 


AE000197 


4, 408 


EcXA004 


rpsG, rpsL 


b3341 


rpsG, 
rpsL 


AE000410 


5,409 


EcXA005a 


rplL, rplJ 


b3986 


rplL, rplJ 


AE000472 


6,410 


EcXA005b 


rplL 


rplL 


rplL 


AE000472 


7,411 


EcXA005c 


rplL, rplJ 


rplL, rplJ 


rplL, rplJ 


AE000472 


8,412 


EcXA005d 


rplL, rplJ 


rplL, rplJ 


rplL, rplJ 


AE000472 


9,413 


EcXA005e 


rplL 


rplL 


rplL 


AE000472 


10,414 


EcXA005f 


rplL 


rplL 


rplL 


AE000472 


11,415 


EcXA005g 


rplL 


rplL 


rplL 


AE000472 


12,416 


EcXA006 


pta 


b2297 


pta 


AE000319 


13,417 


EcXA007 


yicP 


b3666 


yicP 


AE000444 


14,418 


EcXA008a 


yhaU 


b3127 


yhaU 


AE000394 


15,419 


EcXA008b 


yhalJ 


yhaU 


yhalJ 


AE000394 


16,420 


EcXA008c 


yhaU 


yhaU 


yhaU 


AE000394 


17, 421 


EcXA009 


ydeY 


ydeY 


ydeY 


AE000249 


18,422 


EcXAOlOa 
(natural as) 


dicF 


bl575 


dicF 


AE000253 


19, 423 


EcXAOlOb 


dicF 


dicF 


dicF 


AE000253 


20, 424 


EcXAOll 


fdnG 


bl474 


fdnG 


AE000244 


21, 425 


EcXA012a 


fusA 


b3340 


fusA 


AE000410 


22, 426 


EcXA012b 


fusA 


fusA 


fusA 


AE000410 


23, 427 


EcXA012c 


fusA 


fusA 


fusA 


AE000410 


24, 428 


EcXA013a 


o86 


b2562 


yfhL 


AE000342 


25, 429 


EcXAOBb 


o86 


b2562 


yfhL 


AE000342 


26, 430 


EcXA013c 


o86 


b2562 


yfhL 


AE000342 


27, 431 


EcXAOM 


visC 


b2906 


visC 


AE000374 


28, 432 


EcXA015 


yfdl 


yfdl 


yfdl 


AE000323 


29, 433 


EcXA016 


yeaQ 


yeaQ 


yeaQ 


AE000274 






yoaG 


yoaG 


yoaG 
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SEQ 
ID 
NO. 


Molecule 
No. 


Gene 
(NCBI) 


Gene 
(Blattner) 


(Rudd) 




30, 434 


EcXA017a 




b2922 




AE000375 


31,435 


EcXA017b 


VS2E 




V22E 


AE000375 


32, 436 


EcXA018a 


o464 


b2074 


yegM 


AE000297 


33, 437 


EcXAOlSb 


o464 


b2074 




AE000297 


34, 438 


EcXA019a 


yehA 


yehA 


yehA 


AE000300 












AF0n0^9Q 


35, 439 


EcXA019b 


0 1 72 vphA 


n J 72 vphA 


o]72 
yehA 


AFOnO^QQ 

xw^xjkjxj ^ y y 


36, 440 


EcXA020 


o384 f82 


b 1 794, 
bl795 


yeciPy 
yeaQ 


AE000274 


37, 441 


EcXAOlla 


fin 


b0218 


yafU 


AE000130 


38, 442 


EcXA021b 


fill 


b0218 


yaJU 


AE000i30 


39, 443 


EcXA022 


o740 


bl629 


yu^i\ 


AF000258 


40, 444 


EcXA023a 


f 176^/382 


bl504, 
bl505 


ydeS, 
vdpT 


AE000247 


41, 445 


EcXA023b 


fl76 f382 


bI505 


vdpS 
ydeT 


AFf)nn947 


42, 446 


EcXA024 




b3082 






43, 447 


EcXA025 


02383 


bl878 


yeeJ 


AE000289 


44, 448 


EcXA026 


06 1 


dieted 


J lyiTiTP- 

dieted 


AF0001 18 


45, 449 


EcXA027a 


yohH 


yohH 


yohH 


AE000303 


46, 450 


EcXA027b 


yohH 


yohH 


yohH 


AE000303 


47, 451 


EcXA027c 


yohH 


yohH 


yohH 


AE000303 






yohl 


yohl 


yohl 




48, 452 


EcXA027d 


yohH 


yohH 


yohH 


AE000303 


49, 453 


EcXA028 


f296 


b2305 


vfcl 


AE000319 


50, 454 


EcXA029 


yJJK 


b4391 


yjJK 


AE000509 


51,455 


EcXA030 


yiSA 


b3557 


yiSA 


AE000433 


52, 456 


EcXA03 1 


rplE 


B3308 


nil 




53, 457 


EcXA032a 










54, 458 


EcXA032b 
** 


vb2D 


vb^D 


vh^D 
yu^Lj 


AFOOOIT'S 








2UA 


2ltA 




55, 459 


EcXA033a 


f477 (as) 


b3052 


wauE 














AE000386 


56, 460 


EcXA033b 


f477 


b3052 

V \y fc> Am* 


wqclK 

TV K/ti^^J^ 


AE000387 


57, 461 


EcXA034a 


cspA 


b3556 


cspA 


AE000433 


58, 462 


EcXA034b 


cspA 


b3556 


cspA 


AE000433 


59, 463 


EcXA035 


yhju 


yhjU 


yhjU 


AE000431 


60, 464 


EcXA036 


yqjF 


bSlOl 


yqjF 


AE000392 






o99 


bSlOO, 


yqjK 




61, 465 


EcXA037 


ydeH 


bl535 


ydeH 


AE000251 
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wo 
iri 

XJL/ 

NO 


ivxoiecuic 


-_. 


Gene 
(rSlattner) 


Gene 
(Kuda) 


CONTIG 


uz., too 




sieB 


A 7 3 ^' 3 


sieB 


AE000233 






yoou 




ybbD 


At000156 








oj44j 


insB_o 


A T7 A A A /I 0 

Ajb000420 


UJ, T-O!? 






oil JO 


ymfE 


AE000214 


OO5 *T / U 




rpLi 


««7 V 

rpii 


rplY 


A A A A AO 

AE000308 


67, 471 


EcXA042b 


rplY 


rplY 


rplY 


AE000308 
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EXAMPLE 4 

Identification of Genes and their Corresponding Operons Affected by Antisense Inhibition 
The sequencing of the entire E. coli genome is described in Blattner et al. Science 

5 277:1453-1474(1997) the entirety of which is hereby incorporated by reference and the 

sequence of the genome is listed in GenBank Accession No.U00096, the disclosure of 
which is incorporated herein by reference in its entirety. The operons to which the 
proUferation-mhibiting nucleic acids correspond were identified using RegulonDB and 
information in the literature. The coordinates of the boundaries of these operons on the E. 

0 coli genome are listed in Table IIL Table II lists the molecule numbers of the inserts 
containing the growth inhibiting nucleic acid fragments, the genes in the operons 
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corresponding to the inserts, the SEQ ID NOs of the genes containing the inserts, the SEQ 
ID NOs of the proteins encoded by the genes, the start and stop points of the genes on the 
E. coH genome, the orientation of the genes on the genome, whether the operons are 
predicted or documented, and the predicted functions of the genes. The identified operons, 
their putative functions, and whether or not the genes are presently thought to be required 
for proUferation are discussed below. 

Functions for the identified genes were determined by using either Blattner 
functional class designations or by comparing identified sequence with known sequences 
in various databases. A variety of biological fimctions were noted for the genes to which 
the clones of the present invention correspond. The functions for the genes of interest 
appear in Table II. 

The proteins that are listed in Table II are involved in a wide range of biological 
functions. 
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translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 
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Predicted functional 
class of encoded 
proteins 


















Lysis protein 










Blattner functional 
class of encoded 
proteins 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Cell processes (incl. 

Adaptation, 

protection) 


Phage, transposon, or 
plasmid 


Putative enzymes 


Hypothetical ORF, 
unclassified. 
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4545301 


4545841 
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1638684 
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2101411 


2102531 
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2100938 


2101413 


Genes On 
Operon 


fimC 


fimD 


fimF 


fimG 


fimH . 






1 


1 










Mole. No. 












EcXA047 










EcXA048 






Gene 
Prod. Seq 
ID No. 


en 

CO 


^ 

CO 


in 

CO 


CO 


CO 


00 

VO 

CO 


ON 
CO 


o 

CO 


CO 


CN 
l> 
CO 


CO 
CO 


CO 


in 

CO 


GeneSeq 
ID 
No. 


CO 

o 

(N 


o 


m 
o 

<N 


o 

CN 


o 

(N 


oo 
o 

<N 


ON 

o 

CN 


o 




(N 


CO 




in 

CN 



in 



Predicted functional 
class of encoded 
proteins 










UDP galacto- 
pyranase mutase 


Unknown 


Putative auto- 
transporter 


Hypothetical outer 
membrane protein 






Blattner functional 
class of encoded 
proteins 


unknown 


Putative enzymes 


Hypothetical ORF, 

unclassified, 

unknown 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Putative enzymes 


Hypothetical ORF, 

unclassified, 

unknown 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 
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2103106 


2104079 


2105248 


2106351 


404042 


393642 


394353 


3446205 

1 
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3446396 
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Coordinate 




2102516 


2103087 


2104082 


2105248 


402927 


392239 


393685 


3445951 
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class of encoded 
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Translation, post- 

translational 
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Translation, post- 

translational 

modification 


Translation, post- 
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modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 
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Right 
Coordinate 


3446806 


3447520 


3447870 


3448163 


3449001 


3449321 


3449923 


3450563 
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3446819 


3447538 


3447885 


3448180 


3449019 


3449318 


3449934 
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Translation 




Glutathione oxido- 
reductase 




Translation (rRNA) 


Blattner functional 
class of encoded 
proteins 


Translation, post- 
translational 
modification j 


Translation, post- 

translational 
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Translation, post- 

translational 
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Translation, post- 

translational 

modification 


Translation, post- 

translational 
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Biosynthesis of 
cofactors, prosthetic 
groups and carriers 
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translational 
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1797773 
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1798666 
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Translation (rRNA) 


Translation (tRNA) 


Translation (rRNA) 


Blattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Predicted 
(P)Or 
Documented 

(D) 
Operon 








Right 
Coordinate 


2724208 


2727464 


2729178 
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Coordinate 


2724089 


2727389 


1 2727636 


Genes On 
Operon 
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Several of the expression vectors contain fragments that correspond to genes of 
unknown function or if the function is known, it is not known whether the gene is 
essential. For example, EcXAOOl, 003, 007, 008, 013, 015, 016, 017, 018, 019, 020, 021, 
022, 023, 024, 025, 026, 027, 028, 029, 030, 032, 033, 034, 035, 036, 037, 038, 039, 040, 
5 041, 047, 048, 049 and 050 are all exogenous nucleic acid sequences that correspond to E, 

coli proteins that have no known function or where the function has not been shown to be 
essential or nonessential. 

The present invention reports a number of novel E, coli genes and operons that are 
required for proUferation. From the list clone sequences identified here, each was 

10 identified to be a portion of a gene in an operon required for the proliferation of E, coll 

Cloned sequences corresponding to genes already known to be required for proliferation in 
£ coli include EcXA002, 004, 005, 010, 012, 014, 031, 02, 043, 045, 051, 052, 054, and 
055. The remaining identified sequences correspond to E. coli genes previously 
undesignated as required for proliferation in the art. 

15 An interesting observation of the present invention is that there are also several 

sequence fragments that correspond to E, coli genes that are not thought to be required for 
E. coli proliferation. Nevertheless, under the conditions described above, the antisense 
expression of these gene fragments causes a reduction in cell growth. This result implies 
that the genes corresponding to the identified sequences are actually required for 

20 proliferation. Molecule Nos. corresponding to these genes are EcXA006, 044, 046, and 

053. 

Following identification of the sequences of interest, these sequences were 
localized into operons. Since bacterial genes are expressed in a polycistronic manner, the 
antisense inhibition of a single gene in an operon might effect the expression of all the 
25 other genes on the operon or the genes down stream from the single gene identified. In 

order to determine which of the gene products in an operon are required for proliferation, 
each of the genes contained within an operon may be analyzed for their effect on viability 
as described below. 
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TABLE III 



Operon Boundaries 



Mole. No. 


Left 


Right 




Coordinate 


Coordinate 


EcXAOOl 


3606848 


3608143 


EcXA002 


2702355 


2703329 


EcXAOOS 


1015762 


1017522 


EcXA004 


3467782 


3472189 


EcXAOOS 


4177574 


4178503 


EcXA006 


2412767 


2414911 


EcXAOO? 


3841591 


3843357 


EcXA008 


3268266 


3272548 


EcXA009 


1599514 


1605313 


EcXAOlO 


1647406 


1647458 


EcXAOll 


1545425 


1550015 


EcXA012 


3467782 


3472189 


EcXA013 


2697683 


2697943 


EcXA014 


3049135 


3053470 


EcXAOlS 


2465875 


2468482 


EcXA016 


1877031 


1877972 


EcXAOl? 


3065360 


3066100 


EcXAOlS 


2151891 


2160901 


EcXAOl 9 


2185400 


2190242 


EcXA020 


1877031 


1877972 


EcXA021 


238746 


239084 


EcXA022 


1703791 


1710182 


EcXA023 


1585817 


1588025 


EcXA024 


3231369 


3232096 


EcXA025 


2042885 


2050036 


EcXA026 


331001 


331184 


EcXA027c 


2225343 


2228405 


EcXA028 


2420669 


2421559 


EcXA029 


4626424 


4628091 


EcXA030 


3718309 


3719678 


EcXA031 


3440255 


3445786 


EcXA032b 


751452 


753691 


EcXA033 


3192961 


3198606 


EcXA034 


3717678 


3717890 


EcXA035 


3694087 


3697522 


EcXA036 


3246594 


3248594 


EcXA037 


1620984 


1621874 


EcXA038 


1416572 


1417368 


EcXA039 


522485 


528354 


EcXA040 


3580669 


3580672 


EcXA041 


1196090 


1197460 
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Mole. No. 


Left 
Coordinate 


Right 
Coordinate 


EcXA042 


2280537 


2280821 


EcXA043 


765207 


773404 


EcXA044 


1189839 


1191209 


EcXA045 


2816575 


2817168 


EcXA046 


4538525 


4547279 


EcXA047 


1637054 


1639578 


EcXA048 


2099917 


2106351 


EcXA049 


402927 


404042 


EcXAOSO 


392239 


394353 


EcXAOSl 


3445951 


3450907 


EcXA052 


1797417 


1800594 


EcXA053 


3643929 


3645281 


EcXA054 


3440255 


3445786 


EcXA055 


2724301 


2729178 



EXAMPLE 5 

Identification of Individual Genes within an Operon Required for Proliferation 
The following example illustrates a method for determining which gene in an 

5 operon is required for proliferation. The clone insert corresponding to Molecule No. 

EcXA004 possesses nucleic acid sequence homology to the E, coli genes rspG and 
rspL. This molecule corresponds to an operon containing two additional genes fusA and 
tufA. The rpsL gene is the first gene in the operon. To determine which gene or genes 
in this operon are required for proliferation, each gene is selectively inactivated using 

10 homologous recombination. Gene rpsL is the first gene to be inactivated. 

Deletion inactivation of a chromosomal copy of a gene in E, coli can be 
accomplished by integrative gene replacement. The principle of this method (Hamilton, 
C, M., et al 1989. 1 Bacteriol 171: 4617-4622) is to construct a mutant allele of the 
targeted gene, introduce that allele into the chromosome using a conditional suicide 

15 vector, and then force the removal of the native wild type allele and vector sequences. 

This will replace the native gene with a desired mutation(s) but leave promoters, 
operators, etc. intact. Essentiality of a gene is determined either by deduction from 
genetic analysis or by conditional expression of a wild type copy of the targeted gene 
(trans complementation). 

20 The first step is to generate a mutant rpsL allele using PGR amplification. Two 

sets of PGR primers are chosen to produce a copy of rpsL with a large central deletion 
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to inactivate the gene. In order to eliminate polar effects, it is desirable to construct a 
mutant allele comprising an in-frame deletion of most or all of the coding region of the 
rpsL gene. Each set of PGR primers is chosen such that a region flanking the gene to be 
amplified is sufficiently long to allow recombination (typically at least 500 nucleotides 
5 on each side of the deletion). The targeted deletion or mutation will be contained within 

this fragment. To facilitate cloning of the PGR product, the PGR primers may also 
contain restriction endonuclease sites found in the cloning region of a conditional 
knockout vector such as pK03 (Link, et al 1997 J. Bacteriol 179 (20): 6228-6237). 
Suitable sites include NotI, Sail, BamHI and Smal. The rpsL gene fragments are 

10 produced using standard PGR conditions including, but not limited to^ those outlined in 

the manufacturers directions for the Hot Start Taq PGR kit (Qiagen, Inc., Valencia, 
CA). The PGR reactions will produce two fragments that can be fused together. 
Alternatively, crossover PGR can be used to generate a desired deletion in one step (Ho, 
S. N., et al 1989. Gene 11: 51-59, Horton, R. M., et al 1989. Gene 11: 61-68). The 

15 mutant allele thus produced is called a "null" allele because it cannot produce a 

functional gene product. 

The mutant allele obtained from PGR amplification is cloned into the multiple 
cloning site of pK03. Directional cloning of the rpsL null allele is not necessary. The 
pK03 vector has a temperature-sensitive origin of replication derived from pSGlOl. 

20 Therefore, clones are propagated at the permissive temperature of 30°G, The vector 

also contains two selectable marker genes: one that confers resistance to 
chloramphenicol and another, the Bacillus subtilis sacB gene, that allows for counter- 
selection on sucrose containing growth medium. Glones that contain vector DNA with 
the null allele inserted are confirmed by restriction endonuclease analysis and DNA 

25 sequence analysis of isolated plasmid DNA. The plasmid containing the rpsL null allele 
insert is known as a knockout plasmid. 

Once the knockout plasmid has been constructed and its sequence verified, it is 
transformed into a Rec"^ E. coli host cell. Transformation can be by any standard 
method such as electroporation. In some fraction of the transformed cells, plasmids will 

30 integrate into the E, coli chromosome by homologous recombination between the rpsL 

null allele in the plasmid and the rpsL gene in the chromosome. Transformant colonies 
in which such an event has occurred are readily selected by growth at the non- 
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peraiissive temperature of 43°C and in the presence of choramphenicoL At this 
temperature, the plasmid will not replicate as an episome and will be lost from cells as 
they grow and divide. These cells are no longer resistant to chloramphenicol and will 
not grow when it is present. However, cells in which the knockout plasmid has 
5 integrated into the coli chromosome remain resistant to chloramphenicol and 

propagate. 

Cells containing integrated knock-out plasmids are usually the result of a single 
crossover event that creates a tandem repeat of the mutant and native wild type alleles 
ofrpsL separated by the vector sequences. A consequence of this is that rpsL will still 

10 be expressed in these cells. In order to determine if the gene is essential for growth, the 
wild type copy must be removed. This is accomplished by selecting for plasmid 
excision, a process in which homologous recombination between the two alleles results 
in looping out of the plasmid sequences. Cells that have undergone such an excision 
event and have lost plasmid sequences including sacB gene are selected for by addition 

15 of sucrose to the medium. The sacB gene product converts sucrose to a toxic molecule. 

Thus counter selection with sucrose ensures that plasmid sequences are no longer 
present in the cell. Loss of plasmid sequences is further confirmed by testing for 
sensitivity to chloramphenicol (loss of the chloramphenicol resistance gene). The latter 
test is important because occasionally a mutation in the sacB gene can occur resulting in 

20 a loss of sacB function with no effect on plasmid replication (Link, et. al., 1997 J, 

Bacteriol 179 (20): 6228-6237). These artifact clones retain plasmid sequences and are 
therefore still resistant to chloramphenicol. 

In the process of plasmid excision, one of the two rpsL alleles is lost from the 
chromosome along with the plasmid DNA. In general, it is equally likely that the null 

25 allele or the wild type allele will be lost. Therefore, if the rpsL gene is not essential, 

half of the clones obtained in this experiment will have the wild type allele on the 
chromosome and half will have the null allele. However, if the rpsL gene is essential, 
cells containing the null allele will not be obtained as a single copy of the null allele 
would be lethal. 

30 To determine the essentiality of rpsL, a statistically significant number of the 

resulting clones, at least 20, are analyzed by PCR amplification of the rpsL gene. Since 
the null allele is missing a significant portion of the rpsL gene, its PCR product is 



-61- 



significantly shorter than that of the wild type gene and the two are readily 
distinguished by gel electrophoretic analysis. The PGR products may also be subjected 
to sequence determination for fiirther confirmation by methods well known to those in 
the art. 

5 The above experiment is generally adequate for determining the essentiality of a 

gene such as rpsL, However, it may be necessary or desirable to more directly confirm 
the essentiality of the gene. There are several methods by which this can be 
accomplished. In general, these involve three steps: 1) construction of an episome 
containing a wild type allele, 2) isolation of clones containing a single chromosomal 

10 copy of the mutant null allele as described above but in the presence of the episomal 

wild type allele, and then 3) determining if the cells survive when the expression of the 
episomal allele is shut off In this case, the trans copy of wild type rpsL is made by 
PGR cloning of the entire coding region of rpsL and inserting it in the sense orientation 
downstream of an inducible promoter such as the E. coli lac promoter. Transcription of 

15 this allele of rpsL will be induced in the presence of IPTG which inactivates the lac 

repressor. Under IPTG induction rpsL protein will be expressed as long as the 
recombinant gene also possesses a ribosomal binding site, also known as a "Shine- 
Dalgamo Sequence", The trans copy of rpsL is cloned on a plasmid that is compatible 
with pSGlOl. Gompatible vectors include pl5A, pBR322, and the pUG plasmids, 

20 among others. Replication of the compatible plasmid will not be temperature-sensitive. 

The entire process of integrating the null allele of rpsL and subsequent plasmid excision 
is carried out in the presence of IPTG to ensure the expression of functional rpsL 
protein is maintained throughout. After the null rpsL allele is confirmed as integrated 
on the chromosome in place of the wild type rpsL allele, then IPTG is withdrawn and 

25 expression of functional rpsL protein shut off If the rpsL gene is essential, cells will 

cease to proliferate under these conditions. However, if the rpsL gene is not essential, 
cells will continue to proliferate under these conditions. In this experiment, essentiality 
is determined by conditional expression of a wild type copy of the gene rather than 
inability to obtain the intended chromosomal disruption. 

30 An advantage of this method over some other gene disruption techniques is that 

the targeted gene can be deleted or mutated without the introduction of large segments 
of foreign DNA. Therefore, polar effects on downstream genes are eliminated or 
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minimized. There are methods described to introduce inducible promoters upstream of 
potential essential bacterial genes. However in such cases, polarity from multiple 
transcription start points can be a problem. One way of preventing this is to insert a 
gene disruption cassette that contains strong transcriptional terminators upstream of the 
5 integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J, Bacteriol 178 (12): 

3614-3620). The described techniques will all be familiar to one of ordinary skill in the 
art. 

Following the analysis of the rpsL gene, the other genes of the operon are 

investigated to determine if they are required for proliferation. 

10 EXAMPLE 6 

Expression of the Proteins Encoded by Genes Identified as Required for E. coli 

Proliferation 

The following is provided as one exemplary method to express the proliferation- 
required proteins encoded by the identified sequences described above. First, the initiation 

15 and termination codons for the gene are identified. If desired, methods for improving 

translation or expression of the protein are well known in the art. For example, if the 
nucleic acid encoding the polypeptide to be expressed lacks a methionine codon to serve as 
the initiation site, a strong Shine-Delgamo sequence, or a stop codon, these sequences can 
be added. Similarly, if the identified nucleic acid sequence lacks a transcription 

20 termination signal, this sequence can be added to the construct by, for example, splicing 
out such a sequence fi-om an appropriate donor sequence. In addition, the coding sequence 
may be operably linked to a strong promoter or an inducible promoter if desired. The 
identified nucleic acid sequence or portion thereof encoding the polypeptide to be 
expressed is obtained by PGR fi:om the bacterial expression vector or genome using 

25 oligonucleotide primers complementary to the identified nucleic acid sequence or portion 

thereof and containing restriction endonuclease sequences for Ncol incorporated into the 5' 
primer and BgRl at the 5' end of the corresponding 3 '-primer, taking care to ensure that the 
identified nucleic acid sequence is positioned in fi*ame with the termination signal The 
purified fi-agment obtained fi*om the resulting PGR reaction is digested with Ncol and 

30 BgRl, purified and ligated to an expression vector. 

The ligated product is transformed into DH5a or some other E. coli strain suitable 
for the over expression of potential proteins. Transformation protocols are well known in 
the art. For example, transformation protocols are described in; Current Protocols in 
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Molecular Biology, Vol. 1, Unit 1.8, (Ausubel, et al., Eds.) John Wiley & Sons, Inc. 
(1997). Positive transformants are selected after growing the transformed cells on plates 
containing 50-100 jig/ml Ampicillin (Sigma, St. Louis, Missouri). In one embodiment, the 
expressed protein is held in the cytoplasm of the host organism. In an alternate 
5 embodiment, the expressed protein is released into the culture medium. In still another 

alternative, the expressed protein can be sequestered in the periplasmic space and liberated 
therefrom using any one of a number of cell lysis techniques known in the art. For 
example, the osmotic shock cell lysis method described in Chapter 16 of Current 
Protocols in Molecular Biology, Vol, 2, (Ausubel, et al., Eds.) John Wiley & Sons, Inc. 

1 0 (1997). Each of these procedures can be used to express a proliferation-required protein. 

Expressed proteins, whether in the culture medium or liberated from the 
periplasmic space or the cytoplasm, are then purified or enriched from the supernatant 
using conventional techniques such as ammonium sulfate precipitation, standard 
chromatography, immunoprecipitation, immunochromatography, size exclusion 

15 chromatography, ion exchange chromatography, and HPLC. Alternatively, the secreted 

protein can be in a sufficiently enriched or pure state in the supernatant or growth media of 
the host to permit it to be used for its intended purpose without further enrichment. The 
purity of the protein product obtained can be assessed using techniques such as Coomassie 
or silver staining or using antibodies against the control protein. Coomassie and silver 

20 staining techniques are familiar to those skilled in the art. 

Antibodies capable of specifically recognizing the protein of interest can be 
generated using synthetic peptides using methods well known in the art. See, Antibodies: 
A Laboratory Manual, (Harlow and Lane, Eds.) Cold Spring Harbor Laboratory (1988). 
For example, 15-mer peptides having a sequence encoded by the appropriate identified 

25 gene sequence of interest or portion thereof can be chemically synthesized. The synthetic 

peptides are injected into mice to generate antibodies to the polypeptide encoded by the 
identified nucleic acid sequence of interest or portion thereof Alternatively, samples of 
the protein expressed from the expression vectors discussed above can be purified and 
subjected to amino acid sequencing analysis to confirm the identity of the recombinantly 

30 expressed protein and subsequently used to raise antibodies. An Example describing in 

detail the generation of monoclonal and polyclonal antibodies appears in Example 7. 
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The protein encoded by the identified nucleic acid sequence of interest or portion 
thereof can be purified using standard immunochromatography techniques. In such 
procedures, a solution containing the secreted protein, such as the culture medium or a cell 
extract, is applied to a column having antibodies against the secreted protein attached to 
5 the chromatography matrix. The secreted protein is allowed to bind the 
immunochromatography column. Thereafter, the column is washed to remove non- 
specifically bound proteins. The specifically bound secreted protein is then released fi:om 
the column and recovered using standard techniques. These procedures are well known in 
the art, 

10 In an alternative protein purification scheme, the identified nucleic acid sequence 

of interest or portion thereof can be incorporated into expression vectors designed for use 
in purification schemes employing chimeric polypeptides. In such strategies the coding 
sequence of the identified nucleic acid sequence of interest or portion thereof is inserted in- 
fi-ame v^th the gene encoding the other half of the chmiera. The other half of the chimera 

15 can be maltose binding protein (MBP) or a nickel binding polypeptide encoding sequence. 

A chromatography matrix having antibody to MBP or nickel attached thereto is then used 
to purify the chimeric protein. Protease cleavage sites can be engineered between the 
MBP gene or the nickel binding polypeptide and the identified expected gene of interest, 
or portion thereof Thus, the two polypeptides of the chimera can be separated from one 

20 another by protease digestion. 

One useful expression vector for generating maltose binding protein fusion 
proteins is pMAL (New England Biolabs), which encodes the malE gene. In the pMal 
protein fusion system, the cloned gene is inserted into a pMal vector downstream from the 
malE gene. This results in the expression of an MBP-fusion protein. The fusion protein is 

25 purified by affinity chromatography. These techniques as described are well known to 
those skilled in the art of molecular biology. 

EXAMPLE 7 
Production of an Antibody to an isolated E. coli Protein 
Substantially pure protein or polypeptide is isolated from the transformed cells as 

30 described in Example 6. The concentration of protein in the final preparation is adjusted, 

for example, by concentration on a 10,000 molecular weight cut off AMICON filter device 
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(Millipore, Bedford, MA), to the level of a few micrograms/ml. Monoclonal or polyclonal 
antibody to the protein can then be prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 
Monoclonal antibody to epitopes of any of the peptides identified and isolated as 
5 described can be prepared from murine hybridomas according to the classical method of 

Kohler, G. and Milstein, Nature 256:495 (1975) or any of the well-known derivative 
methods thereof Briefly, a mouse is repetitively inoculated with a few micrograms of the 
selected protein or peptides derived therefrom over a period of a few weeks. The mouse is 
then sacrificed, and the antibody producing cells of the spleen isolated. The spleen cells 

10 are fused by means of polyethylene glycol with mouse myeloma cells, and the excess 

unfused cells destroyed by growth of the system on selective media comprising 
aminopterin (HAT media). The successfully fiised cells are diluted and aliquots of the 
dilution placed in wells of a microtiter plate where growth of the culture is continued. 
Antibody-producing clones are identified by detection of antibody in the supernatant fluid 

15 of the wells by immunoassay procedures, such as ELISA, as described by Engvall, E., 

"Enzyme immunoassay ELISA and EMIT," Meth, EnzymoL 70:419 (1980), and 
derivative methods thereof Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. Detailed procedures for monoclonal 
antibody production are described in Davis, L. et al. Basic Methods in Molecular Biology 

20 Elsevier, New York. Section 21-2, 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single 
protein or a peptide can be prepared by immunizing suitable animals with the expressed 
protein or peptides derived therefrom described above, which can be unmodified or 

25 modified to enhance immunogenicity. Effective polyclonal antibody production is 

affected by many factors related both to the antigen and the host species. For example, 
small molecules tend to be less immxmogenic than larger molecules and can require the use 
of carriers and adjuvant. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. 

30 Small doses (ng level) of antigen administered at multiple intradermal sites appears to be 

most reliable. An effective immunization protocol for rabbits can be found in Vaitukaitis, 
J. et al. J. Clin. Endocrinol Metab. 33:988-991 (1971). 
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Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, 
for example, Ouchterlony, O. et al.. Chap. 19 in: Handbook of Experimental 
5 Immunology D. Wier (ed) Blackwell (1 973). Plateau concentration of antibody is usually 

in the range of 0.1 to 0.2 mg/ml of serum (about 12 |aM). Affmity of the antisera for the 
antigen is determined by preparing competitive binding curves, as described, for example, 
by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, 
Eds.) Amer. Soc. For Microbiol, Washington, D.C. (1980). 

10 Antibody preparations prepared according to either protocol are useful in 

quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi-quantitatively or qualitatively to identify the 
presence of antigen in a biological sample. The antibodies can also be used in therapeutic 
compositions for killing bacterial cells expressing the protein. 

15 EXAMPLES 

Screening Chemical Libraries 

A. Protein-Based Assays 

Having isolated and expressed bacterial proteins shown to be required for bacterial 
proliferation, the present invention further contemplates the use of these expressed proteins 

20 in assays to screen libraries of compounds for potential drug candidates. The generation of 

chemical libraries is well known in the art. For example combinatorial chemistry can be 
used to generate a library of compounds to be screened in the assays described herein. A 
combinatorial chemical library is a collection of diverse chemical compounds generated by 
either chemical synthesis or biological synthesis by combining a number of chemical 

25 "building blocks" reagents. For example, a linear combinatorial chemical library such as a 

polypeptide library is formed by combining amino acids in every possible combination to 
yield peptides of a given length. Millions of chemical compounds theoretically can be 
synthesized through such combinatorial mixings of chemical building blocks. For 
example, one commentator observed that the systematic, combinatorial mixing of 100 

30 interchangeable chemical building blocks results in the theoretical synthesis of 100 million 

tetrameric compounds or 10 billion pentameric compounds. (Gallop et al., "Applications 
of Combinatorial Technologies to Drug Discovery, Background and Peptide 
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Combinatorial Libraries" Journal of Medicinal Chemistry, Vol. 37, No. 9, 1233-1250 
(1994). Other chemical libraries known to those in the art may also be used, including 
natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that 
5 possess desirable biological properties. For example, compounds which may be useful as 

drugs or to develop drugs would likely have the ability to bind to the target protein 
identified, expressed and purified as discussed above. Further, if the identified target 
protein is an enzyme, candidate compounds would likely interfere with the enzymatic 
properties of the target protein. Any enzyme can be a target protein. For example, the 

10 enzymatic function of a target protein can be to serve as a protease, nuclease, phosphatase, 
dehydrogenase, transporter protein, transcriptional enzyme, and any other type of 
enzyme known or unknown. Thus, the present invention contemplates using the protein 
products described above to screen combinatorial chemical libraries. 

Those in the art will appreciate that a number of techniques exist for characterizing 

15 target proteins in order to identify molecules useful for the discovery and development of 

therapeutics. For example, some techniques involve the generation and use of small 
peptides to probe and analyze target proteins both biochemically and genetically in order 
to identify and develop drug leads. Such techniques include the methods described in PCT 
publications No. W09935494, W09819162, W09954728, the disclosures of which are 

20 incorporated herein by reference in their entireties. 

In another example, the target protein is a serine protease and the substrate of the 
enzyme is known. The present example is directed towards the analysis of libraries of 
compounds to identify compounds that function as inhibitors of the target enzyme. First, a 
library of small molecules is generated using methods of combinatorial library formation 

25 well known in the art. U.S. Patent NOs. 5,463,564 and 5,574, 656, to Agrafiotis, et al,, 
entitled "System and Method of Automatically Generating Chemical Compound with 
Desired Properties," are two such teachings. Then the library compounds are screened to 
identify library compounds that possess desired structural and functional properties. U.S. 
Patent No. 5,684,71 1 also discusses a method for screening libraries. 

30 To illustrate the screening process, the combined target and chemical compounds 

of the library are exposed to and permitted to interact with the purified enzyme. A labeled 
substrate is added to the incubation. The label on the substrate is such that a detectable 
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signal is emitted from metabolized substrate molecules. The emission of this signal 
permits one to measure the effect of the combinatorial library compounds on the 
enzymatic activity of target enzymes. The characteristics of each library compound is 
encoded so that compounds demonstrating activity against the enzyme can be analyzed 
5 and features common to the various compounds identified can be isolated and combined 
into future iterations of libraries. 

Once a library of compounds is screened, subsequent libraries are generated using 
those chemical building blocks that possess the features shown in the first round of screen 
to have activity against the target enzyme. Using this method, subsequent iterations of 
10 candidate compounds will possess more and more of those structural and functional 
features required to inhibit the function of the target enzyme, until a group of enzyme 
inhibitors with high specificity for the enzyme can be found. These compounds can then 
be further tested for their safety and efficacy as antibiotics for use in mammals. 

y It will be readily appreciated that this particular screening methodology is 

^ 15 exemplary only. Other methods are well known to those skilled in the art. For 

example, a wide variety of screening techniques are known for a large number of 

'j" naturally-occurring targets when the biochemical function of the target protein is 

^ known. 

L B. Cell Based Assays 

20 Current cell-based assays used to identify or to characterize compounds for drug 

LJi discovery and development frequently depend on detecting the ability of a test 

y compound to inhibit the activity of a target molecule located within a cell or located on 

the surface of a cell. Most often such target molecules are proteins such as enzymes, 
receptors and the like. However, target molecules may also include other molecules 
25 such as DNAs, lipids, carbohydrates and RNAs including messenger RNAs, ribosomal 

RNAs, tRNAs and the like, A number of highly sensitive cell-based assay methods are 
available to those of skill in the art to detect binding and interaction of test compounds 
with specific target molecules. However, these methods are generally not highly 
effective when the test compoimd binds to or otherwise interacts with its target 
30 molecule with moderate or low affinity. In addition, the target molecule may not be 
readily accessible to a test compound in solution, such as when the target molecule is 
located inside the cell or within a cellular compartment such as the periplasm of a 
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bacterial cell. Thus, current cell-based assay methods are limited in that they are not 
effective in identifying or characterizing compounds that interact with their targets with 
moderate to low affinity or compounds that interact with targets that are not readily 
accessible. 

Cell-based assay methods of the present invention have substantial advantages 
over current cell-based assays practiced in the art. These advantages derive from the 
use of sensitized cells in which the level or activity of a proliferation-required gene 
product (the target molecule) has been specifically reduced to the point where the 
presence or absence of its function becomes a rate-determining step for cellular 
proliferation. Bacterial, fungal, plant, or animal cells can all be used with the present 
method. Such sensitized cells become much more sensitive to compounds that are 
active against the affected target molecule. Thus, cell-based assays of the present 
invention are capable of detecting compounds exhibiting low or moderate potency 
against the target molecule of interest because such compounds are substantially more 
potent on sensitized cells than on non-sensitized cells. The affect may be such that a 
test compound may be two to several times more potent, at least 10 times more potent 
or even at least 100 times more potent when tested on the sensitized cells as compared 
to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic 
microorganisms and to the significant side-effects associated with some currently used 
antibiotics, novel antibiotics acting at new targets are highly sought after in the art. Yet, 
another limitation in the current art related to cell-based assays is the problem of 
identifying hits against the same kinds of target molecules in the same limited set of 
biological pathways over and over again. This may occur when compounds acting at 
such new targets are discarded, ignored or fail to be detected because compounds acting 
at tiie "old" targets are encountered more fi:equently and are more potent than 
compounds acting at the new targets. As a resuU, the majority of antibiotics in use 
currentiy interact with a relatively small number of target molecules within an even 
more limited set of biological pathways. 

The use of sensitized cells of the current invention provides a solution to the 
above problem in two ways. First, desired compounds acting at a target of interest, 
whether a new target or a previously known but poorly exploited target, can now be 
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detected above the ''noise" of compounds acting at the "old" targets due to the specific 
and substantial increase in potency of such desired compounds when tested on the 
sensitized cells of the current invention. Second, the methods used to sensitize cells to 
compounds acting at a target of interest may also sensitize these cells to compounds 

5 acting at other target molecules within the same biological pathway. For example, 
expression of an antisense molecule to a gene encoding a ribosomal protein is expected 
to sensitize the cell to compounds acting at that ribosomal protein and may also 
sensitize the cells to compounds acting at any of the ribosomal components (proteins or 
rRNA) or even to compounds acting at any target which is part of the protein synthesis 

10 pathway. Thus an important advantage of the present invention is the ability to reveal 
new targets and pathways that were previously not readily accessible to drug discovery 
methods. 

Sensitized cells of the present invention are prepared by reducing the activity or 
level of a target molecule. The target molecule may be a gene product, such as an RNA 

15 or polypeptide produced from the proliferation-required nucleic acids described herein. 

Alternatively, the target may be a gene product such as an RNA or polypeptide which is 
produced form a sequence within the same operon as the proliferation-required nucleic 
acids described herein. In addition, the target may be an RNA or polypeptide in the 
same biological pathway as the proliferation-required nucleic acids described herein. 

20 Such biological pathways include, but are not limited to, enzymatic, biochemical and 
metabolic pathways as well as pathways involved in the production of cellular 
structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial 
chemistries are able to make use of structure-activity relationship information derived 

25 from testing compounds in various biological assays including direct binding assays and 

cell-based assays. Occasionally compounds are directly identified in such assays that 
are sufficiently potent to be developed as drugs. More often, initial hit compounds 
exhibit moderate or low potency. Once a hit compound is identified with low or 
moderate potency, directed libraries of compounds are synthesized and tested in order 

30 to identify more potent leads. Generally these directed libraries are combinatorial 
chemical libraries consisting of compounds with structures related to the hit compound 
but containing systematic variations including additions, subtractions and substitutions 
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of various structural features. When tested for activity against the target molecule, 
structural features are identified that either alone or in combination with other features 
enhance or reduce activity. This information is used to design subsequent directed 
libraries containing compounds with enhanced activity against the target molecule. 
5 After one or several iterations of this process, compounds with substantially increased 

activity against the target molecule are identified and may be further developed as 
drugs. This process is facilitated by use of the sensitized cells of the present invention 
since compounds acting at the selected targets exhibit increased potency in such cell- 
based assays, thus; more compounds can now be characterized providing more useful 

1 0 information than would be obtained otherwise. 

Thus, it is now possible using cell-based assays of the present invention to 
identify or characterize compounds that previously would not have been readily 
identified or characterized including compounds that act at targets that previously were 
not readily exploited using cell-based assays. The process of evolving potent drug leads 

15 from initial hit compounds is also substantially improved by the cell-based assays of the 
present invention because, for the same number of test compounds, more structure- 
function relationship information is likely to be revealed. 

The method of sensitizing a cell entails selecting a suitable gene or operon. A 
suitable gene or operon is one whose expression is required for the proliferation of the 

20 cell to be sensitized. The next step is to introduce into the cells to be sensitized, an 
antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA 
encoded by the suitable gene or operon. Introduction of the antisense RNA can be in 
the form of an expression vector in which antisense RNA is produced under the control 
of an inducible promoter. The amount of antisense RNA produced is limited by varying 

25 the inducer concentration to which the cell is exposed and thereby varying the activity 
of the promoter driving transcription of the antisense RNA. Thus, cells are sensitized 
by exposing them to an inducer concentration that results in a sub-lethal level of 
antisense RNA expression. 

In one embodiment of the cell-based assays, the identified exogenous E. coli 

30 nucleotide sequences of the present invention are used to inhibit the production of a 

proliferation-required protein. Expression vectors producing antisense RNA against 
identified genes required for proliferation are used to limit the concentration of a 
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proliferation-required protein without severly inhibiting growth. To achieve that goal, a 
growth inhibition dose curve of inducer is calculated by plotting various doses of 
inducer against the corresponding growth inhibition caused by the antisense expression. 
From this curve, various percentages of antisense induced growth inhibition, from 1 to 
100% can be determined. If the promoter contained in the expression vector contains a 
lac operator the transcription is regulated by lac repressor and expression from the 
promoer is inducible with IPTG. For example, the highest concentration of the inducer 
IPTG that does not reduce the growth rate (0% growth inhibition) can be predicted from 
the curve. Cellular proliferation can be monitored by growth medium turbidity via OD 
measurements. In another example, the concentration of inducer that reduces growth by 
25% can be predicted from the curve. In still another example, a concentration of 
inducer that reduces growth by 50% can be calculated. Additional parameters such as 
colony forming units (cfu) can be used to measure cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of 
inducer. The presence of the inducer at this sub-lethal concentration reduces the 
amount of the proliferation required gene product to the lowest amount in the cell that 
will support growth. Cells grown in the presence of this concentration of inducer are 
therefore specifically more sensitive to inhibitors of the proliferation-required protein or 
RNA of interest or to inhibitors of proteins or RNAs in the same biological pathway as 
the proliferation-required protein or RNA of interest but not to inhibitors of unrelated 
proteins or RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus containing 
a reduced amount of proliferation-required target gene product are then used to screen 
for compounds that reduce cell growth. The sub-lethal concentration of inducer may be 
any concentration consistent with the intended use of the assay to identify candidate 
compounds to which the cells are more sensitive. For example, the sub-lethal 
concentration of the inducer may be such that growth inhibition is at least about 5%, at 
least about 8%, at least about 10%, at least about 20%, at least about 30%, at least about 
40%, at least about 50%, at least about 60% at least about 75%, or more. Cells which 
are pre-sensitized using the preceding method are more sensitive to inhibitors of the 
target protein because these cells contain less target protein to inhibit than wild-type 
cells. 
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In another embodiment of the cell based assays of the present invention, the level or 
activity of a proUferation required gene product is reduced using a temperature sensitive 
...mutation in the proliferation-required sequence and an antisense nucleic acid against 
the proliferation-required sequence. Growing the cells at an intermediate temperature 
between the permissive and restrictive temperatures of the temperature sensitive mutant 
where the mutation is in a proliferation-required gene produces cells with reduced 
activity of the proliferation-required gene product. The antisense RNA directed against 
the proliferation-required sequence further reduces the activity of the proliferation 
required gene product. Drugs that may not have been found using either the 
temperature sensitive mutation or the antisense nucleic acid alone may be identified by 
determining whether cells in which expression of the antisense nucleic acid has been 
induced and which are grown at a temperature between the permissive temperature and 
the restrictive temperature are substantially more sensitive to a test compound than cells 
in which expression of the antisense nucleic acid has not been induced and which are 
grown at a permissive temperature. Also drugs found previously from either the 
antisense nucleic acid alone or the temperature sensitive mutation alone may have a 
different sensitivity profile when used in cells combining the two approaches, and that 
sensitivity profile may indicate a more specific action of the drug in inhibiting one or 
more activities of the gene product. 

Temperature sensitive mutations may be located at different sites within the 
gene and correspond to different domains of the protem. For example, the dnaB gene of 
Escherichia coli encodes the replication fork DNA helicase. DnaB has several 
domains, including domains for oligomerization, ATP hydrolysis, DNA binding, 
interaction with primase, interaction with DnaC, and interaction with DnaA [(Biswas, 
E.E. and Biswas, S.B. 1999. Mechanism and DnaB helicase of Escherichia coli: 
structural domains involved in ATP hydrolysis, DNA binding, and oligomerization. 
Biochem. 38:10919-10928; Hiasa, H. and Marians, K.J. 1999. Initiation of bidirectional 
replication at the chromosomal origin is directed by the interaction between helicase 
and primase. J. Biol. Chem. 274:27244-27248; San Martin, C, Radermacher, M., 
Wolpensinger, B., Engel, A., Miles, C.S., Dixon, N.E., and Carazo, J.M. 1998. Three- 
dimensional reconstructions fi:om cryoelectron microscopy images reveal an intimate 
complex between heUcase DnaB and its loading partner DnaC. Structure 6:501-9; 
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Sutton, M.D., Can, K.M., Vicente, M., and Kaguni, J.M. 1998. Escherichia coli DnaA 
protein. The N-terminal domain and loading of DnaB helicase at the E. coli 
chromosomal. J. Biol. Chem. 273:34255-62.), the disclosures of which are incorporated 
herein by reference in their entireties]. Temperature sensitive mutations in different 
domains of DnaB confer different phenotypes at the restrictive temperature, which 
include either an abrupt stop or slow stop in DNA replication with or without DNA 
breakdown (Wechsler, J. A. and Gross, J.D. 1971. Escherichia coli mutants temperature- 
sensitive for DNA synthesis. Mol. Gen. Genetics 113:273-284, the disclosure of which 
is incorporated herein by reference in its entnety) and termination of growth or cell 
death. Combining the use of temperature sensitive mutations in the dnaB gene that 
cause cell death at the restrictive temperature with an antisense to the dnaB gene could 
lead to the discovery of very specific and effective inhibitors of one or a subset of 
activities exhibited by DnaB. 

When screening for antimicrobial agents against a gene product required for 
proliferation, growth inhibition of cells containing a limiting amount of that 
proliferation-required gene product can be assayed. Growth inhibition can be measured 
by directly comparing the amount of growth, measured by the optical density of the 
growth medium, between an experimental sample and a control sample. Alternative 
methods for assaying cell proliferation include measuring green fluorescent protein 
(GFP) reporter construct emissions, various enzymatic activity assays, and other 
methods well known in the art. 

It will be appreciated that the above method may be performed in solid phase, 
liquid phase or a combination of the two. For example, cells grown on nutrient agar 
containing the inducer of the antisense construct may be exposed to compounds spotted 
onto the agar surface. A compound's effect may be judged from the diameter of the 
resulting killing zone, the area around the compound application point in which cells do 
not grow. Multiple compounds may be transferred to agar plates and simultaneously 
tested using automated and semi-automated equipment including but not restricted to 
multi-channel pipettes (for example the Beckman Multimek) and multi-channel spotters 
(for example the Genomic Solutions Flexys). In this way multiple plates and thousands 
to millions of compounds may be tested per day. 
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The compounds may also be tested entirely in liquid phase using microliter 
plates as described below. Liquid phase screening may be performed in microtiter 
plates containing 96, 384, 1536 or more wells per microtiter plate to screen multiple 
plates and thousands to millions of compoimds per day. Automated and semi- 
5 automated equipment may be used for addition of reagents (for example cells and 

compounds) and determination of cell density. 

EXAMPLE 9 

The effectiveness of the above cell based assay was validated using constructs 
expressing antisense RNA to E. coli genes rplL, rplJ, and rplW encoding ribosomal 

10 proteins L7/L12, LIO and L23 respectively. These proteins are part of the protein 
synthesis apparatus of the cell and as such are required for proliferation. These 
constructs were used to test the effect of antisense expression on cell sensitivity to 
antibiotics known to bind to the ribosome and thereby inhibit protein synthesis. 
Constructs expressing antisense RNA to several other genes (elaD, visC, yohH, and 

15 aptE/B), the products of which are not involved in protein synthesis were used for 

comparison. 

First expression vectors containing antisense constructs to either rplW or to elaD 
were introduced into separate E. coli cell populations. Vector introduction is a 
technique well knovm to those of ordinary skill in the art. The expression vectors of 

20 this example contain IPTG inducible promoters that drive the expression of the 

antisense RNA in the presence of the inducer. However, those skilled in the art will 
appreciate that other inducible promoters may also be used. Suitable expression vectors 
are also well known in the art. The E. coli antisense clones encoding ribosomal proteins 
L7/L12, LIO and L23 were used to test the effect of antisense expression on cell 

25 sensitivity to the antibiotics known to bind to these proteins. First, expression vectors 
containing antisense to either the genes encoding L7/L12 and LIO or L23 were 
introduced into separate E. coli cell populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid 
medium to obtain the growth inhibitory dose curve for each clone (Fig. 1). First, seed 

30 cultures were grown to a particular turbidity that is measured by the optical density 
(OD) of the grov^h solution. The OD of the solution is directly related to the number of 
bacterial cells contained therein. Subsequently, sixteen 200 ul liquid medium cultures 
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were grown in a 96 well microtiter plate at 37 C with a range of IPTG concentrations in 
duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). 
Additionally, control cells were grown in duplicate without IPTG. These cultures were 
started from equal amounts of cells derived from the same initial seed culture of a clone 
5 of interest. The cells were grown for up to 15 hours and the extent of growth was 

determined by measuring the optical density of the cultures at 600 nm. When the 
control culture reached mid-log phase the percent growth of the control for each of the 
IPTG containing cultures was plotted against the log concentrations of IPTG to produce 
a growth inhibitory dose response curve for the IPTG. The concentration of IPTG that 

10 inhibits cell growth to 50% (IC50) as compared to the 0 mM IPTG control (0% growth 
inhibition) was then calculated from the curve. Under these conditions, an amount of 
antisense RNA was produced that reduced the expression levels of rplW and elaD to a 
degree such that growth was inhibited by 50%, 

Alternative methods of measuring growth are also contemplated. Examples of 

15 these methods include measurements of proteins, the expression of which is engineered 

into the cells being tested and can readily be measured. Examples of such proteins 
include green fluorescent protein (GFP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to 
test the sensitivity of cell populations to tetracycline, erythromycin and other protein 

20 synthesis inhibitors. An example of a tetracycline dose response curve is shown in 
Figures 2A and 2B for the rplW and elaD genes, respectively. Cells were grown to log 
phase and then diluted into media alone or media containing IPTG at concentrations 
which give 20% and 50% growth inhibition as determined by IPTG dose response 
curves. After 2.5 hours, the cells were diluted to a final OD600 of 0,002 into 96 well 

25 plates containing (1) +/- IPTG at the same concentrations used for the 2.5 hour pre- 
incubation; and (2) serial two-fold dilutions of tetracycline such that the final 
concentrations of tetracycline range from 1 [xg/ml to 15.6 ng/ml and 0 [ig/ml. The 96 
well plates were incubated at 37°C and the OD600 was read by a plate reader every 5 
minutes for up to 15 hours. For each IPTG concentration and the no IPTG control, 

30 tetracycline dose response curves were determined when the control (absence of 
tetracycline) reached 0.1 OD600. To compare tetracycline sensitivity with and without 
IPTG, tetracycline IC50s were determined from the dose response curves (Figs. 2A-B). 
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Cells with reduced levels of L23 (rplW) showed increased sensitivity to tetracycline 
(Fig. 2A) as compared to cells with reduced levels of elaD (Fig. 2B). Figure 3 shows a 
summary bar chart in which the ratios of tetracycline IC50s determined in the presence 
of IPTG which gives 50% growth inhibition versus tetracycline IC50s determined 
without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells with reduced 
levels of either L7/L12 (genes rplL, rplJ) or L23 {rplW) showed increased sensitivity to 
tetracycline (Fig. 3). Cells expressing antisense to genes not known to be involved in 
protein synthesis {atpB/E, visC, elaD, yohH) did not show the same increased 
sensitivity to tetracycline, validating the specificity of this assay (Fig. 3). 

In addition to the above, it has been observed in initial experiments that clones 
expressing antisense RNA to genes involved in protein synthesis (including genes 
encoding ribosomal proteins L7/L12 & LIO, L7/L12 alone, L22, and LI 8, as well as 
genes encoding rRNA and Elongation Factor G) have increased sensitivity to the 
macrolide, erythromycin, whereas clones expressing antisense to the non-protein 
synthesis genes elaD, atpB/E and visC do not. Furthermore, the clone expressing 
antisense to rplL and rplJ does not show increased sensitivity to nalidixic acid and 
ofloxacin, antibiotics which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rplL, rplJ, and rplW as well as the 
initial results using various other antisense clones and antibiotics show that limiting the 
concentration of an antibiotic target makes cells more sensitive to the antimicrobial 
agents that specifically interact with that protein. The results also show that these cells 
are sensitized to antimicrobial agents that inhibit the overall function in which the 
protein target is involved but are not sensitized to antimicrobial agents that inhibit other 
functions. 

The cell based assay described above may also be used to identify the biological 
pathway in which a proliferation-required nucleic acid or its gene product lies. In such 
methods, cells expressing a sub-lethal level of antisense to a target proliferation- 
required nucleic acid and control cells in which expression of the antisense has not been 
induced are contacted with a panel of antibiotics known to act in various pathways. If 
the antibiotic acts in the pathway in which the target proliferation-required nucleic acid 
or its gene product lies, cells in which expression of the antisense has been induced will 
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be more sensitive to the antibiotic than cells in which expression of the antisense has 
not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of 
cells expressing antisense nucleic acids to many different proliferation-required genes 
5 including the target proliferation-required gene. If the antibiotic is acting specifically, 
heightened sensitivity to the antibiotic will be observed only in the cells expressing 
antisense to a target proliferation-required gene (or cells expressing antisense to other 
proliferation-required genes in the same pathway as the target proliferation-required 
gene) but will not be observed generally in all cells expressing antisense to 
10 proliferation-required genes. 

Similarly, the above method may be used to determine the pathway on which a 
test antibiotic acts. A panel of cells, each of which expresses antisense to a 
proliferation-required nucleic acid in a known pathway, is contacted with a compound 
for which it is desired to determine the pathway on which it acts. The sensitivity of the 
15 panel of cells to the test compound is determined in cells in which expression of the 
antisense has been induced and in control cells in which expression of the antisense has 
not been induced. If the test antibiotic acts on the pathway on which an antisense 
nucleic acid acts, cells in which expression of the antisense has been induced will be 
more sensitive to the antibiotic than cells in which expression of the antisense has not 
20 been induced. In addition, control cells in which expression of antisense to 
proliferation-required genes in other pathways has been induced will not exhibit 
heightened sensitivity to the antibiotic. In this way, the pathway on which the test 
antibiotic acts may be determined. 

The Example below provides one method for performing such assays. 
25 EXAMPLE 10 

Identification of thePathwav in which a Proliferation-Required 
Gene Lies or the Pathwav on which an Antibiotic Acts 
A. Preparation of Bacterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria 
30 containing the desired antisense construct are prepared using standard microbiological 
techniques. For example, a single clone of the organism can be isolated by streaking out 
a sample of the original stock onto an agar plate containing nutrients for cell growth and 
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an antibiotic for which the antisense construct contains a gene which confers resistance. 
After overnight growth an isolated colony is picked from the plate with a sterile needle 
and transferred to an appropriate liquid growth media containing the antibiotic required 
for maintenance of the plasmid. The cells are incubated at 30°C to 37°C with vigorous 
shaking for 4 to 6 hours to yield a culture in exponential growth. Sterile glycerol is 
added to 15% (volume to volume) and lOOfaL to 500 |aL aliquots are distributed into 
sterile cryotubes, snap frozen in liquid nitrogen, and stored at -80°C for fliture assays. 

B. Growth of Bacteria for Use in the Assav 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed 
(37°C water bath) and a loop of culture is streaked out on an agar plate containing 
nutrients for cell growlh and an antibiotic to which the antisense construct confers 
resistance. After overnight growth at 37°C, ten randomly chosen, isolated colonies are 
fransferred from the plate (sterile moculum loop) to a sterile tube containing 5 mL of 
LB medium containing the antibiotic to which the antisense vector confers resistance. 
After vigorous mixing to form a homogeneous cell suspension, the optical density of the 
suspension is measured at 600 nm (OD600) and if necessary an aliquot of the 
suspension is diluted into a second tube of 5 mL, sterile, LB medium plus antibiotic to 
achieve an OD600 < 0.02 absorbance units. The culture is then incubated at 37° C for 
1-2 hrs with shaking until the OD600 reaches OD 0.2 - 0.3. At this point the cells are 
ready to be used in the assay. 

C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing 
the appropriate antibiotic for maintenance of the antisense construct. Several media are 
tested side by side and three to four wells are used to evaluate the effects of the inducer 
at each concentration in each media. For example, M9 minimal media, LB broth, TBD 
broth and MuUer-Hinton media may be tested with the inducer IPTG at the following 
concentrations, 50 |aM, 100 ^iM, 200 ^iM, 400 piM, 600 |aM, 800 and 1000 ^iM. 
Equal volumes of test media-inducer and cells are added to the wells of a 384 well 
microtiter plate and mixed. The cells are prepared as described above and diluted 1 :100 
in the appropriate media containing the test antibiotic immediately prior to addition to 
the microtiter plate wells. For a control, cells are also added to several wells of each 
media that do not contain inducer, for example 0 \iM IPTG. Cell growth is monitored 
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continuously by incubation at 37°C in a microtiter plate reader monitoring the OD600 
of the wells over an 18-hour period. The percent inhibition of growth produced by each 
concentration of inducer is calculated by comparing the rates of logarithmic growth 
against that exhibited by cells growing in media without inducer. The medium yielding 
greatest sensitivity to inducer is selected for use in the assays described below. 

Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct 
Induction 

Two-fold dilution series of antibiotics of known mechanism of action are 
generated in the culture media selected for further assay development that has been 
supplemented with the antibiotic used to maintain the construct. A panel of test 
antibiotics known to act on different pathways is tested side by side with three to four 
wells being used to evaluate the effect of a test antibiotic on cell growth at each 
concentration. Equal volumes of test antibiotic and cells are added to the wells of a 384 
well microtiter plate and mixed. Cells are prepared as described above using the media 
selected for assay development supplemented with the antibiotic required to maintain 
the antisense construct and are diluted 1:100 in identical media immediately prior to 
addition to the microtiter plate wells. For a control, cells are also added to several wells 
that contain the solvent used to dissolve the antibiotics but no antibiotic. Cell growth is 
monitored continuously by incubation at 37°C in a microtiter plate reader monitoring 
the OD600 of the wells over an 18-hour period. The percent inhibition of growth 
produced by each concentration of antibiotic is calculated by comparing the rates of 
logarithmic growth against that exhibited by cells growing in media without antibiotic. 
A plot of percent inhibition against log[antibiotic concentration] allows extrapolation of 
an IC50 value for each antibiotic. 

E- Measurement of Test A ntibiotic Sensitivitv in the Presence of Antisense Construct 
Inducer 

The culture media selected for use in the assay is supplemented with inducer at 
concentrations shown to inhibit cell growth by 50 and 80% as described above and the 
antibiotic used to maintain the construct. Two fold dilution series of the panel of test 
antibiotics used above are generated in each of these media. Several antibiotics are 
tested side by side with three to four wells being used to evaluate the effects of an 
antibiotic on cell growth at each concentration, in each media. Equal volumes of test 
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antibiotic and ceils are added to tiie wells of a 384 well microtiter plate and mixed. 
Cells are prepared as described above using the media selected for use in the assay 
supplemented with the antibiotic required to maintain the antisense construct. The cells 
are diluted 1:100 into two 50 mL aliquots of identical media containing concentrations 
of inducer that have been shown to inhibit cell growth by 50% and 80 % respectively 
and incubated at 37°C with shaking for 2.5 hours. Immediately prior to addition to the 
microtiter plate wells, the cultures are adjusted to an appropriate ODeoo (typically 0.002) 
by dilution into warm (37°C) sterile media supplemented with identical concentrations 
of the inducer and antibiotic used to maintain the antisense construct. For a control, 
cells are also added to several wells that contain solvent used to dissolve test antibiotics 
but which contain no antibiotic. Cell growth is monitored continuously by incubation at 
37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour 
period. The percent inhibition of growth produced by each concentration of antibiotic is 
calculated by comparing the rates of logarithmic growth against that exhibited by cells 
growing in media without antibiotic. A plot of percent inhibition against Iog[antibiotic 
concentration] allows extrapolation of an IC50 value for each antibiotic. 
F- Determinmg the Specificity of the Test Antibiotics 

A comparison of the IC50S generated by antibiotics of known mechanism of 
action under antisense induced and non-induced conditions allows the pathway in which 
a proliferation-required nucleic acid lies to be identified. If cells expressing an 
antisense nucleic acid against a proliferation-required gene are selectively sensitive to 
an antibiotic acting via a particular pathway, then the gene against which the antisense 
acts is involved in the pathway in which the antibiotic acts. 
G- Identification of Pathway in which a Test Antibiotic Acts 

As discussed above, the cell based assay may also be used to determine the 
pathway against which a test antibiotic acts. In such an analysis, the pathways against 
which each member of a panel of antisense nucleic acids acts are identified as described 
above. A panel of cells, each containing an inducible antisense vector against a gene in 
a known proliferation-required pathway, is contacted with a test antibiotic for which it 
is desired to determine the pathway on which it acts under inducing an non-inducing 
conditions. If heightened sensitivity is observed in induced cells expressing antisense 
against a gene in a particular pathway but not in induced cells expressing antisense 



-82- 



against genes in other pathways, then the test antibiotic acts against the pathway for 
which heightened sensitivity was observed. 

One skilled in the art will appreciate that further optimization of the assay 
conditions, such as the concentration of inducer used to induce antisense expression 
5 and/or the growth conditions used for the assay (for example incubation temperature 

and media components) may further increase the selectivity and/or magnitude of the 
antibiotic sensitization exhibited. 

The following example confirms the effectiveness of the methods described 

above. 

10 EXAMPLE 11 

Identification of the Pathway in which a Proliferation-Required Gene Lies 
Antibiotics of various chemical classes and modes of action were purchased 
from Sigma Chemicals (St. Louis, MO). Stock solutions were prepared by dissolving 
each antibiotic in an appropriate aqueous solution based on information provided by the 

15 manufacturer. The final working solution of each antibiotic contained no more than 

0.2% (w/v) of any organic solvent. To determine their potency against a bacterial strain 
engineered for expression of an antisense against a proliferation-required SOS ribosomal 
protein, each antibiotic was serially diluted two or three fold in growth medium 
supplemented with the appropriate antibiotic for maintenance of the anti-sense 

20 construct. At least ten dilutions were prepared for each antibiotic. 25 ^L aliquots of 
each dilution were transferred to discrete wells of a 384-well microplate (the assay 
plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of 
an antibiotic under each treatment condition (plus and minus inducer). Each assay plate 
contained twenty wells for cell growth controls (growth media replacing antibiotic), ten 

25 wells for each treatment (plus and minus inducer, in this example IPTG). Assay plates 

were usually divided into the two treatments: half the plate containing induced cells and 
an appropriate concentrations of inducer (in this example IPTG) to maintain the state of 
induction, the other half containing non-induced cells in the absence of IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a 

30 construct, fi-om which expression of antisense nucleic acid against rplL and rplJ, which 

encode proliferation-required SOS ribosomal subunit proteins, is inducible in the 
presence of IPTG, were grown into exponential growth (OD600 0.2 to 0.3) and then 
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diluted 1:100 into fresh media containing either 400 [xM or 0 j^M inducer (IPTG). These 
cultures were incubated at 37° C for 2.5 hr. After a 2.5 hi incubation, induced and non- 
induced cells were respectively diluted into an assay medium at a final ODeoo value of 
0.0004. The medium contained an appropriate concentration of the antibiotic for the 
maintenance of the anti-sense construct. In addition, the medium used to dilute induced 
cells was supplemented with 800 |iM IPTG so that addition to the assay plate would 
result in a final IPTG concentration of 400 jiM. Induced and non-induced cell 
suspensions were dispensed (25 iil/well) into the appropriate wells of the assay plate as 
discussed previously. The plate was then loaded into a plate reader, incubated at 
constant temperature, and cell growth was monitored in each well by the measurement 
of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell 
culture attained a stationary growth phase. For each concentration of antibiotic, a 
percentage inhibition of growth was calculated at the time point corresponding to mid- 
exponential growth for the associated control wells (no antibiotic, plus or minus IPTG). 
For each antibiotic and condition (plus or minus IPTG), a plot of percent inhibition 
versus log of antibiotic concentration was generated and the IC50 determined. A 
comparison of the IC50 for each antibiotic in the presence and absence of IPTG revealed 
whether induction of the antisense construct sensitized the cell to the mechanism of 
action exhibited by the antibiotic. Cells which exhibited a significant (standard 
statistical analysis) numerical decrease in the IC50 value in the presence of inducer were 
considered to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of 
the antibiotics used in the analysis, the targets of the antibiotics, the IC50 in the absence 
of IPTG, the IC50 in the presence of IPTG, the concentration units for the IC50s, the 
fold increase in IC50 in the presence of IPTG, and whether increased sensitivity was 
observed in the presence of IPTG. 
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The above results demonstrate that induction of an antisense RNA to genes 
encodmg 508 ribosomal subunit proteins results in a selective and highly significant 
sensitization of cells to antibiotics that inhibit ribosomal function and protein synthesis. 
The above results further demonstrate that induction of an antisense construct to an 
essential gene sensitizes an organism to compounds that interfere with that gene 
products' biological role. This sensitization is restricted to compounds that interfere 
with pathways associated with the targeted gene and it's product. 

Assays utilizing antisense constructs to essential genes can be used to identify 
compounds that specifically interfere with the activity of multiple targets in a pathway. 
Such constructs can be used to simultaneously screen a sample against multiple targets 
in one pathway in one reaction (Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells 
may be used to characterize the point of intervention of any compound affecting an 
essential biological pathway including antibiotics with no known mechanism of action. 

Another embodiment of the present invention is a method for determining the 
pathway against which a test antibiotic compound is active in which the activity of 
target proteins or nucleic acids involved in proliferation-required pathways is reduced 
by contacting cells with a sublethal concentration of a known antibiotic which acts 
against the target protein or nucleic acid. In one embodiment, the target protein or 
nucleic acid is a target protem or nucleic acid corresponding to a proliferation-required 
nucleic acid identified using the methods described above. The method is similar to 
those described above for determining which pathway a test antibiotic acts against 
except that rather than reducing the activity or level of a proliferation-required gene 
product using a sublethal level of antisense to a proliferation-required nucleic acid, the 
activity or level of the proliferation-required gene product is reduced using sublethal 
level of a known antibiotic which acts against the proliferation required gene product. 

Interactions between drugs which affect the same biological pathway has been 
described in the literature. For example, Mecillinam (Amdinocillin) binds to and 
inactivates the penicillin binding protein 2 (PBP2, product of the mrdA in E. coli). This 
antibiotic inteacts with other antibiotics that inhibit PBP2 as well as antibiotics that 
inhibit other penicillin binding proteins such as PBP3 [(Gutmann, L., Vincent, S., 
Billot-Klein, D., Acar, J.F., Mrena, E., and WiUiamson, R. (1986) Involvement of 
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penicillin-binding protein 2 with other penicillin-binding proteins in lysis of 
Escherichia coli by some beta-lactam antibiotics alone and in synergistic lytic effect of 
amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 30:906-912), the 
disclosure of which is incorporated herein by reference in its entirety]. Interactions 
5 between drugs could, therefore, involve two drugs that inhibit the same target protein or 

nucleic acid or inhibit different proteins or nucleic acids in the same pathway 
[(Fukuoka, T., Domon, H,, Kakuta, M., Ishii, C, Hirasawa, A., Utsui, Y., Ohya, S., and 
Yasuda, H. (1997) Combination effect between panipenem and vancomycin on highly 
methicillin-resistant Staphylococcus aureus, Japan. J. Antibio. 50:411-419; Smith, C.E., 

10 Foleno, B.E., Barrett, J.F., and Frosc, M.B. (1997) Assessment of the synergistic 

interactions of levofloxacin and ampicillin against Enterococcus faecium by the 
checkerboard agar dilution and time-kill methods. Diagnos. Microbiol Infect. Disease 
27:85-92; den Hollander, J.G., Horrevorts, A.M., van Goor, M.L,, Verbrugh, H.A., and 
Mouton, J.W. (1997) Synergism between tobramycin and ceftazidime against a resistant 

15 Pseudomonas aeruginosa strain, tested in an in vitro pharmacokinetic model. 

Antimicrobial Agents & Chemotherapy. 41:95-110), the disclosure of all of which are 
incorporated herein by reference in their entireties]. 

Two drugs may interact even though they inhibit different targets. For example, 
the proton pump inhibitor. Omeprazole, and the antibiotic, Amoxycillin, two synergistic 

20 compounds acting together, can cure Helicobacter pylori infection [( Gabryelewicz, A., 

Laszewicz, W., Dzieniszewski, J., Ciok, J., Marlicz, K., Bielecki, D., Popiela, T., 
Legutko, J., Knapik, Z., Poniewierka, E. (1997) Multicenter evaluation of dual-therapy 
(omeprazol and amoxycillin) for Helicobacter /?y/on-associated duodenal and gastric 
ulcer (two years of the observation). J. Physiol Pharmacol. 48 Suppl 4:93-105), the 

25 disclosure of which is incorporated herein by reference in its entirety]. 

The growth inhibition from the sublethal concentration of the known antibiotic 
may be at least about 5%, at least about 8%, at least about 10%, at least about 20%, at 
least about 30%, at least about 40%, at least about 50%, at least about 60%, or at least 
about 75%, or more. 

30 Alternatively, the sublethal concentration of the known antibiotic may be 

determined by measuring the activity of the target proliferation-required gene product 
rather than by measuring growth inhibition. 
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Cells are contacted with a combination of each member of a panel of known 
antibiotics at a sublethal level and varying concentrations of the test antibiotic. As a 
control, the cells are contacted with varying concentrations of the test antibiotic alone. 
The IC50 of the test antibiotic in the presence and absence of the known antibiotic is 
5 determined. If the IC50s in the presence and absence of the known drug are 

substantially similar, then the test drug and the known drug act on different pathways. 
If the IC50S are substantially different, then the test drug and the known drug act on the 
same pathway. 

Another embodiment of the present invention is a method for identifying a 
10 candidate compound for use as an antibiotic in which the activity of target proteins or 
nucleic acids involved in proliferation-required pathways is reduced by contacting cells 
with a sublethal concentration of a known antibiotic which acts against the target 
protein or nucleic acid. In one embodiment, the target protein or nucleic acid is a target 
protein or nucleic acid corresponding to a proliferation-required nucleic acid identified 
15 using the methods described above. The method is similar to those described above for 

identifying candidate compounds for use as antibiotics except that rather than reducing 
the activity or level of a proliferation-required gene product using a sublethal level of 
antisense to a proliferation-required nucleic acid, the activity or level of the 
proliferation-required gene product is reduced using a sublethal level of a known 
20 antibiotic which acts against the proliferation required gene product. 

The growth inhibition from the sublethal concentration of the known antibiotic 
may be at least about 5%, at least about 8%, at least about 10%, at least about 20%, at 
least about 30%, at least about 40%, at least about 50%, at least about 60%, or at least 
about 75%, or more. 

25 Alternatively, the sublethal concentration of the known antibiotic may be 

determined by measuring the activity of the target proliferation-required gene product 
rather than by measuring grovrth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a 
panel of known antibiotics at a sublethal level and one or more concentrations of the 

30 test compound. As a control, the cells are contacted with the same concentrations of the 
test compound alone. The IC50 of the test compound in the presence and absence of the 
known antibiotic is determined. If the IC50 of the test compound is substantially 
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different in the presence and absence of the known drug then the test compound is a 
good candidate for use as an antibiotic. As discussed above, once a candidate 
compound is identified using the above methods its structure may be optimized using 
standard techniques such as combinatorial chemistry. 

Representative known antibiotics which may be used in each of the above 
methods are provided in the table below. However, it will be appreciated that other 
antibiotics may also be used. 



ANTIBIOTIC 


INHIBITS/TARGET 


RESISTANT 






MUTANTS 



Inhibitors of Transcription 

Rifamycin, 1959 Rifampicin 

Rifabutin Rifaximin 
Streptolydigin 

Streptovaricin 

Actinomycin D+EDTA 



Inhibits initiation of transcription/B-subunit 
RNA polymerase, rpoB 
Accelerates transcription chain 
termination/fi-subunit RNA polymerase 
an acyclic ansamycin, inhibits RNA 
polymerase 

Intercalates between 2 successive G-C 
pairs, rpoB, inhibits RNA synthesis 



Inhibitors of Nucleic Acid Metabolism 

Quinolones, 1 962 Nalidixic a subunit gyrase and/or topoisomerase IV, 

gyrA 



acid Oxolinic acid 



Fluoroquinolones 
Ciprofloxacin, 1983 
Norfloxacin 
Coumerins Novobiocin 



Coumermycin 



Albicidin 
Metronidazole 



a subunit gyrase, gyrA and/or 
topoisomerase IV (probable target in Staph) 

Inhibits ATPase activity of B-subunit 
gyrase, gyrB 

Inhibits ATPase activity of 6-subunit 
gyrase, gyrB 
DNA synthesis 

Causes single-strand breaks in DNA 



Inhibitors of Metabolic Pathways 

Sulfonamides, 1932 blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, /o//^ 
Inhibits dihydrofolate reductase, 
Nucleoside analogue capable of alkylating 
sulfhydryl groups, inhibitor of thymidylate 
synthetase 

type II fatty acid synthase inhibitor 



Sulfanilamide 
Trimethoprim, 1962 
Shov^domycin 



Thiolactomycin 

Psicofuranine 
Triclosan 

Diazoborines Isoniazid, 
Ethionamide 



Adenosine glycoside antibiotic, target is 
GMP synthetase 
Inhibits fatty acid synthesis 
heterocyclic, contains boron, inhibit fatty 
acid synthesis, enoyl-ACP reductase, /aZ)/ 



rpoB, crp, cyaA 

rpoB 

rpoB 

pldA 

gyrA or B, icd, sloB 
gyrA 

norA (efflux in Staph) 
hipQ 

gyrB, cysB, cysE, nov, 

ompA 
gyrB, hisW 

tsx (nucleoside channel) 
nar 



folP, gpt, pabA, pabB, 
pabC 
foiA, thyA 
nupQ pnp 



emrB 

fadB, emrB due to gene 

dosage 
giiaA.B 



fabi (envM) 
fabi (envM) 
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Inhibitors of Translation 

Phenylpropanoids 

Chloramphenicol, 1947 



Tetracyclines, 1948, type II 

polyketides 

Minocycline 

Doxycycline 
Macrolides {type I polyketides) 

Erythromycin, 1950 

Carbomycin, Spiramycin 

etc 

Aminoglycosides Streptomycin, 
1944 

Neomycin 

Spectinomycin 
Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 
Lincosamides 

Lincomycin, 1955 

Clindamycin 
Streptogramins Virginiamycin, 

1955 Pristinamycin 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin (Mocimycin) 

Pulvomycin 
Thiopeptin 

Tiamulin 
Negamycin 

Oxazolidinones Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids Mupu*ocin 
(Bactroban) 



Binds to ribosomal peptidyl transfer center 

preventing peptide translocation/ binds to rrn, cmlA, mar A, ompF, 
S6, L3, L6, L14, L16, L25, L26, L27, but ompR 
preferentially to L16 

Binding to SOS ribosomal subunit, "A" site cImA (cmr), mar, ompF 
on SOS subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomal subunit, 23S 
rRNA, blocks peptide translocation, L15, 

L 12 rrn, rplC, rplD, rplV , 

mac 

Irreversible binding to SOS ribosomal 

subunit, prevents translation or causes rpsL, strQM, ubiF 

mistranslation of mRNA/1 6S rRNA atpA-E, ec/B, 

hemAQD,E,G, topA, 
rpsQD,E, rrn, spcB 
atpA-atpE, cpxA, ecfB, 

hemA,B,L, top A 
ksgA,B,Qa rplBX 

rpsl,NM,R 
rplF, ubiF 
cpxA 
rpsL 

Binding to 50 S ribosomal subunit, blocks 

peptide translocation UnB, rplN, O, rpsG 

2 components, Streptogramins A&B, bind 
to the 50S ribosomal subunit blocking 
peptide translocation and peptide bond 
formation 

Inhibition of elongation factor G (EF-G) 

prevents peptide translocation 
Inhibition of elongation factor TU (EF-Tu), 
prevents peptide bond formation 
Binds to and inhibits EF-TU 
Sulfur-containing antibiotic, inhibits protein 

synthesis,EF-G 
Inhibits protein synthesis 
Inhibits termination process of protein 

synthesis 
23SrRNA 

pdx 

Inhibits protein synthesis, nitroreductases nfnA,B 
convert nitrofurantoin to highly reactive 
electrophilic intermediates which attack 
bacterial ribosomal proteins non- 
specifically 

Inhibition of isoleucyl tRNA synthetase- ileS 
used for Staph, topical cream, nasal 
spray 



fusA 
tufA,B 

rplE 

rplC, rplD 
prjB 
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Indolmycin 
Viomycin 



Thiopeptides 

Thiostrepton 

Micrococcin 



Inhibits tryptophanyl-tRNA synthetase 



Binds to LI 1-23S KMA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



trpS 

rrmA (23 S rRNA 
methyltransferase; 
mutant has slow 
growth rate, slow 
chain elongation rate, 
and viomycin 
resistance) 



Inhibitors of Cell Walls/Membranes 



6-lactams 

Penicillin, 1929 Ampicillin 

Methicillin, 1960 



Cephalosporins, 1962 
Mecillinam (amdinocillin) 

Aztreonam (Furazlocillin) 
Bacilysin, Tetaine 
Glycopeptides Vancomycin, 1955 

Polypeptides Bacitracin 

Cyclic Iipopeptide Daptomycin, 
1980 



Cyclic polypeptides Polymixm, 

1939 
Fosfomycin, 1969 



Cycloserine 
Alafosfalin 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 
glycosidases (PBPs), of the 12 PBPs only 2 
are essential: mrdA (PBP2) mdftsl (pbpB, 
PBPS) 



Binds to and inactivates PBP2 {mrdA) 
Inactivates PBP3 (ftsi) 

Dipeptide, inhib glucosamine synthase 
Inhib G+ cell wall syn, binds to terminal 
D-ala-D-ala of pentapeptide. 
Prevents dephosphorylation and 
regeneration of lipid carrier 
Disrupts multiple aspects of membrane 
function, including peptidoglycan 
synthesis, lipoteichoic acid synthesis, and 
the bacterial membrane potential 
Surfactant action disrupts cell membrane 
lipids, binds lipid A mioety of LPS 
Analogue of P-enolpyruvate, inhibits 1'* 
step in peptidoglycan synthesis - UDP-N- 
acetylglucosamine enolpyruvyl 
transferase, murA, Also acts as 
Immunosuppressant 
Prevents formation of D-aia dimer, 
inhibits D-ala ligase, ddlA,B 
phosphonodipeptide, cell wall synthesis 
inhibitor, potentiator of P-lactams 



Inhibitors of Protein Processing/Transport 

Globomycin Inhibits signal peptidase II (cleaves 

prolipoproteins subsequent to lipid 
modification, IspA 



ampC, ampD, ampE, 
envZ, galU, hipA, 
hipQ, ompQ ompF, 
ompR, ptsl rfa, 
tolD, tolE 

tonB 

alaS, argS, crp, cyaA, 
envB, mrdA.B, 
mreB,QD 

dppA 



rfa 



pmrA 

murA, crp, cyaA glpZ 
hip A, ptsl uhpT 



hipA, cycA 
pepA, tpp 

Ipp, dnaE 
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EXAMPLE 12 

Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species Using the E. 
coll Express ion Vectors or Expression Vectors Functional in Bacterial Species other than 

E. coli. 

5 The above methods were validated using antisense nucleic acids which inhibit 

the growth of E. coli which were identified using methods similar to those described 
above. Expression vectors which inhibited growth of E. coli upon induction of 
antisense RNA expression with IPTG were transformed directly into Enterobacter 
cloacae, Klebsiella pneumonia or Salmonella typhimurium. The transformed cells were 

10 then assayed for growth inhibition according to the method of Example 1 . After growth 
in liquid culture, cells were plated at various serial dilutions and a score determined by 
calculating the log difference in growth for INDUCED vs. UNINDUCED antisense 
RNA expression as determined by the maximum 10 fold dilution at which a colony was 
observed. The results of these experiments are listed below in Table VI. If there was 

15 no effect of antisense RNA expression in an organism, the clone is minus in Table VL 

In contrast, a positive in Table VI means that at least 10 fold more cells were required to 
observe a colony on the induced plate than on the non-induced plate under the 
conditions used and in that organism. 

Sixteen of the construts were found to inhibit growth in all the organisms tested 

20 upon induction of antisense RNA expression with IPTG. Those skilled in the art will 
appreciate that a negative result in a heterologous organism does not mean that that 
organism is missing that gene nor does it mean that the gene is unessential. However, a 
positive result means that the heterologous organism contains a homologous gene which 
is required for proliferation of that organism. The homologous gene may be obtained 

25 using the methods described herein. Those cells that are inhibited by antisense may be 

used in cell based assays as described herein for the identification and characterization 
of compounds in order to develop antibiotics effective in these organisms. Those 
skilled in the art will appreciate that an antisense molecule which works in the organism 
from which it was obtained will not always work in a heterologous organism, 

30 
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TABLE VI 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit 

ProHferation in E. coli 



Mol. No. 


5". typhimurium 


E. cloacae 


K. pneumoniae 


EcXAOOl 


+ 


+ 


- 


EcXA004 


- 


- 


- 


EcXAOOS 


4- 




+ 


EcXA006 


- 


- 





EcXAOOV 


- 




- 


EcXAOOS 




- 




EcXAOlO 








EcXAOll 


- 




- 


EcXA012 


- 


+ 


- 


EcXAOlS 




+ 




EcXA014 






- 


EcXAOlS 


- 


-f 


+ 


EcXA016 


+ 


4- 




EcXAOl? 


+ 


-i- 




EcXAOlS 






+ 


EcXAOl 9 




+ 




HCAAUZU 








EcXA021 


+ 




+ 


EcXA023 


+ 






EcXA024 








EcXA025 
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Mol. No. 


S. typhimurium 


E. cloacae 


K. pneumoniae 


EcXA026 


+ 


4- 


EcXA027 


+ 






EcXA028 


+ 






EcXA029 








EcXAOSO 


+ 






EcXAOSl 


+ 






EcXA032 


+ 






EcXA033 


+ 




+ 


EcXA034 


+ 




+ 


EcXA035 


\ 




EcXA036 


+ 




+ 


EcXA037 








EcXA038 








EcXA039 


+ 








EcXA041 


+ 




4- 


EcXA042 






+ 


EcXA044 








EcXA045 








EcXA046 








EcXA047 








EcXA048 








EcXA049 








EcXAOSO 
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MoL No. 


S. typhimurium 


E. cloacae 


K. pneumoniae 


EcXAOSl 


+ 


EcXA052 








EcXA053 


+ 


+ 




EcXA054 






+ 


EcXA055 









EXAMPLE 13 

Use of Id entified Exogenous Nucleic Acid Sequences as Probes 
The identified sequence of the present invention can be used as probes to obtain 
the sequence of additional genes of interest firom a second organism. For example, 
probes to potential bacterial target proteins may be hybridized to nucleic acids fi-om 
other organisms including other bacteria and higher organisms, to identify homologous 
sequences. Such hybridization might indicate that the protein encoded by the gene to 
which the probe corresponds is found in humans and therefore not necessarily a good 
drug target. Alternatively, the gene can be conserved only in bacteria and therefore 
would be a good drug target for a broad spectrum antibiotic or antimicrobial. 

Probes derived from the identified nucleic acid sequences of interest or portions 
thereof can be labeled with detectable labels familiar to those skilled in the art, including 
radioisotopes and non-radioactive labels, to provide a detectable probe. The detectable 
probe can be single stranded or double stranded and can be made using techniques known 
in the art, includmg in vitro transcription, nick translation, or kinase reactions. A nucleic 
acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it can be 
denatured prior to contacting the probe. In some applications, the nucleic acid sample can 
be immobilized on a surface such as a nitrocellulose or nylon membrane. The nucleic acid 
sample can comprise nucleic acids obtained fix^m a variety of sources, including genomic 
DNA, cDNA libraries, RNA, or tissue samples. 
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Procedures used to detect the presence of nucleic acids capable of hybridizing to 
the detectable probe include well known techniques such as Southern blotting, Northern 
blotting, dot blotting, colony hybridization, and plaque hybridization. In some 
applications, the nucleic acid capable of hybridizing to the labeled probe can be cloned 
into vectors such as expression vectors, sequencing vectors, or in vitro transcription 
vectors to facilitate the characterization and expression of the hybridizing nucleic acids in 
the sample. For example, such techniques can be used to isolate, purify and clone 
sequences from a genomic library, made from a variety of bacterial species, which are 
capable of hybridizing to probes made from the sequences identified in Examples 5 and 6. 

EXAMPLE 14 
Preparation of PCR Primers and Amplification of ON A 

The identified E. coli genes corresponding directly to or located within the operon 
of nucleic acid sequences required for proliferation or portions thereof can be used to 
prepare PCR primers for a variety of applications, including the identification or isolation 
of homologous sequences from other species, for example S. typhimunum, E. cloacae, and 
Klebsiella pneumoniae, which contain part or all of the homologous genes. Because 
homologous genes are related but not identical in sequence, those skilled in the art will 
often employ degenerate sequence PCR primers. Such degenerate sequence primers are 
designed based on conserved sequence regions, either known or suspected, such as 
conserved coding regions. The successfiil production of a PCR product using degenerate 
probes generated from the sequences identified herein would mdicate the presence of a 
homologous gene sequence in the species being screened. The PCR primers are at least 10 
bases, and preferably at least 20 bases in length. More preferably, the PCR primers are at 
least 20-30 bases in length. In some embodiments, the PCR primers can be more than 30 
bases in length. It is preferred that the primer pairs have approximately the same G/C 
ratio, so that melting temperatures are approximately the same. A variety of PCR 
techniques are familiar to those skilled in the art. For a review of PCR technology, see 
Molecular Cloning to Genetic Engineering White, B.A. Ed. in Methods in Molecular 
Biology 67: Humana Press, Totowa 1997. When the entire coding sequence of the target 
gene is known, the 5' and 3' regions of the target gene can be used as the sequence source 
for PCR probe generation. In each of these PCR procedures, PCR primers on either side 
of the nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid 
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sample along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfli 
polymerase, or Vent polymerase. The nucleic acid in the sample is denatured and the PGR 
primers are specifically hybridized to complementary nucleic acid sequences in the 
sample. The hybridized primers are extended. Thereafter, another cycle of denaturation, 
hybridization, and extension is initiated. The cycles are repeated multiple times to produce 
an amplified firagment containing the nucleic acid sequence between the primer sites. 

EXAMPLE 15 
Inverse PGR 

The technique of inverse polymerase chain reaction can be used to extend the 
known nucleic acid sequence identified in Examples 5 and 6. The inverse PGR reaction is 
described generally by Ochman et al., in Gh. 10 of PGR Technology: Principles and 
Applications for DNA Amplification, (Henry A. Eriich, Ed.) W.H. Freeman and Go. 
(1992). Traditional PGR requires two primers that are used to prime the synthesis of 
complementary strands of DNA. In inverse PGR, only a core sequence need be known. 

Using the sequences identified as relevant from the techniques taught in Examples 
5 and 6 and applied to other species of bacteria, a subset of exogenous nucleic sequences 
are identified that correspond to genes or operons that are required for bacterial 
proliferation. In species for which a genome sequence is not known, the technique of 
inverse PGR provides a method for obtaining the gene in order to determine the sequence 
or to place the probe sequences in fiall context to the target sequence to which the 
identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an 
appropriate restriction enzyme so as to create firagments of nucleic acid tiiat contain the 
identified sequence as well as unknown sequences that flank the identified sequence. 
These fi-agments are then circularized and become tiie template for tiie PGR reaction. PGR 
primers are designed in accordance with the teachings of Example 15 and directed to the 
ends of the identified sequence are synthesized. The primers direct nucleic acid synthesis 
away fi-om the known sequence and toward the unknown sequence contained within the 
circularized template. After the PGR reaction is complete, the resulting PGR products can 
be sequenced so as to extend the sequence of the identified gene past the core sequence of 
tiie identified exogenous nucleic acid sequence identified. In this manner, the fiill 



-98- 



sequence of each novel gene can be identified. Additionally the sequences of adjacent 
coding and noncoding regions can be identified. 

EXAMPLE 16 

Identification of Genes Required fox Stavhvlococcus aureus Proliferation 
5 Genes required for proliferation in Staphylococcus aureus are identified according 

to the methods described above. 

EXAMPLE 17 

Identification of Genes Required for Neisseria gonorrhoeae Proliferation 
Genes required for proliferation in Neisseria gonorrhoeae are identified according 
10 to the methods described above. 

EXAMPLE 18 

Identification of Genes Required for Pseudomonas aeruginosa Proliferation 
Genes required for proliferation in Pseudomonas aeruginosa are identified 
according to the methods described above. 
15 EXAMPLE 19 

Identification of Genes Required for Enterococcus faecalis Proliferation 
Genes required for proliferation in Enterococcus faecalis are identified according 
to the methods described above. 

EXAMPLE 20 

20 Identification of Genes Required for Haemophilus influenzae Proliferation 

Genes required for proliferation in Haemophilus influenzae are identified 
according to the methods described above. 

EXAMPLE 21 

Identification of Genes Required for Salmonella tvvhimurium Proliferation 
25 Genes required for proUferation in Salmonella typhimurium are identified 

according to the methods described above. 

EXAMPLE 22 

Identification of Genes Required for Helicobacter pylori Proliferation 
Genes required for proliferation in Helicobacter pylori are identified according to 
30 the methods described above. 

EXAMPLE 23 

Identification of Genes Required for Mycoplasma pneumoniae Proliferation 
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Genes required for proliferation in Mycoplasma pneumoniae are identified 
according to the methods described above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
5 Genes required for proliferation in Plasmodium ovale are identified according to 

the methods described above, 

EXAMPLE 25 

Identification of Genes Required for Saccharomvces cerevisiae Proliferation 
Genes required for proliferation in Saccharomyces cerevisiae are identified 
10 according to the methods described above. 

EXAMPLE 26 

Identification of Genes Required for Entamoeba histolytica Proliferation 
Genes required for proliferation in Entamoeba histolytica are identified according 
to the methods described above. 
15 EXAMPLE 27 

Identification of Genes Required for Candida albicans Proliferation 
Genes required for proliferation in Candida albicans are identified according to 
the methods described above. 

EXAMPLE 28 

20 Identification of Genes Required for Klebsiella vneumoniae Proliferation 

Genes required for proliferation in Klebsiella pneumoniae are identified according 
to the methods described above. 

EXAMPLE 29 

Identification of Genes Required for Salmonella typhi Proliferation 
25 Genes required for proliferation in Salmonella typhi are identified according to the 

methods described above. 

EXAMPLE 30 

Identification of Genes Required for Salmonella paratyphi Proliferation 
Genes required for proliferation in Salmonella paratyphi are identified according 
30 to the methods described above. 

EXAMPLE 31 

Identification of Genes Required for Salmonella cholerasuis Proliferation 
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Genes required for proliferation in Salmonella cholerasuis are identified according 
to the methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis Proliferation 
Genes required for proliferation in Staphylococcus epidermis are identified 
according to the methods described above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tuberculosis Proliferation 
Genes required for proliferation in Mycobacterium tuberculosis are identified 
according to the methods described above. 

EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according 
to the methods described above. 

EXAMPLE 35 

Identification of Genes R equired for Treponema pallidum Proliferation 
Genes required for proliferation in Treponema pallidum are identified according to 
the methods described above. 

EXAMPLE 36 

Identification of Genes Required for Bacillus anthracis Proliferation 
Genes required for proliferation in Bacillus anthracis are identified according to 
the methods described above. 

EXAMPLE 37 

Identifica tion of Genes Requu-ed for Yersinia pestis Proliferation 
Genes required for proliferation in Yersinia pestis are identified according to tiie 
methods described above. 

EXAMPLE 38 

Identification o f Genes Required for Clostridium botulinum Proliferation 
Genes required for proliferation in Clostridium botulinum are identified according 
to the methods described above. 

EXAMPLE 39 

Identification of Genes Re quired for Campylobacter jejuni Proliferation 
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Genes required for proliferation in Campylobacter jejuni are identified according 
to the methods described above. 

EXAMPLE 40 

Identification o f Genes Required for Chlamydia trachomatis Proliferation 
Genes required for proliferation in Chlamydia trachomatis are identified 
according to the methods described above. 



Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule 

1 0 libraries to identify compounds usefiil to identify antibiotics, the sequences themselves can 
be used as therapeutic agents. Specifically, the identified exogenous sequences in an 
antisense orientation can be provided to an individual to inhibit the translation of a 
bacterial target gene. 

Generation of Antisense Ther apeutics fi-om Identified Exogenous Sequences 
The sequences of the present invention can be used as antisense therapeutics for 
the treatment of bacterial infections or simply for inhibition of bacterial growth in vitro 
or in vivo. The therapy exploits the biological process in cells where genes are 
transcribed into messenger RNA (mRNA) that is then translated into proteins. 
Antisense RNA technology contemplates the use of antisense oligonucleotides directed 

20 against a target gene that will bind to its target and decrease or inhibit the translation of 
the target mRNA. In one embodiment, antisense oligonucleotides can be used to treat 
and control a bacterial infection of a cell culture containing a population of desired cells 
contaminated with bacteria. In another embodiment, the antisense oligonucleotides can 
be used to treat an organism with a bacterial infection. 

25 Antisense oligonucleotides can be synthesized from any of the sequences of the 

present invention using methods well known in tiie art. In a preferred embodiment, 
antisense oligonucleotides are synthesized using artificial means. Uhlmann & 
Peymann, Chemical Rev. 90:543-584 (1990) review antisense oligonucleotide 
technology in detail. Modified or unmodified antisense oligonucleotides can be used as 

30 therapeutic agents. Modified antisense oligonucleotides are preferred since it is well 
known that antisense oligonucleotides are extremely unstable. Modification of the 
phosphate backbones of the antisense oligonucleotides can be achieved by substituting 
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the intemucleotide phosphate residues with methylphosphonates, phosphorothioates, 
phosphoramidates, and phosphate esters. Nonphosphate intemucleotide analogs such as 
siloxane bridges, carbonate brides, thioester bridges, as well as many others known in 
the art. The preparation of certain antisense oligonucleotides with modified 
intemucleotide linkages is described in U.S. Patent No. 5,142,047, hereby incorporated 
by reference. 

Modifications to the nucleoside units of the antisense oligonucleotides are also 
contemplated. These modifications can increase the half-life and increase cellular rates 
of uptake for the oligonucleotides in vivo. For example, a-anomeric nucleotide units 
and modified bases such as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-l- 
phenylribofiiranose, and N^, A/^-ethano-5-methyl-cytosine are contemplated for use in 
the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic 
acids. Peptide nucleic acids (PNA) have been developed to hybridize to single and double 
stranded nucleic acids. PNA are nucleic acid analogs in which the entire deoxyribose- 
phosphate backbone has been exchanged with a chemically completely different, but 
structurally homologous, polyamide (peptide) backbone containing 2-aminoethyl glycine 
units. Unlike DNA, which is highly negatively charged, the PNA backbone is neutral. 
Therefore, there is much less repulsive energy between complementary strands m a PNA- 
DNA hybrid than in the comparable DNA-DNA hybrid, and consequently they are much 
more stable. PNA can hybridize to DNA in either a Watson/Crick or Hoogsteen fashion 
(Demidov et al., Proc. Natl. Acad. Sci. U.S.A. 92:2637-2641, 1995; Egholm, Nature 
365:566-568, 1993; Nielsen et al, Science 254:1497-1500, 1991; Dueholm et al.. New J. 
Chem. 21:19-31, 1997). 

Molecules called PNA "clamps" have been synthesized which have two identical 
PNA sequences joined by a flexible hairpin linker containing three 8-amino-3,6- 
dioxaoctanoic acid units. When a PNA clamp is mixed with a complementary homopurine 
or homopyrimidine DNA target sequence, a PNA-DNA-PNA triplex hybrid can form 
which has been shown to be extremely stable (Bentin et al., Biochemistry 35:8863-8869, 
1996; Eghohn et al., Nucleic Acids Res. 23:217-222, 1995; Griffith et al., J. Am. Chem. 
Soc. 117:831-832, 1995). 
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The sequence-specific and high affinity duplex and triplex binding of PNA have 
been extensively described (Nielsen et al., Science 254:1497-1500, 1991; Eghohn et al, J. 
Am. Chem. Soa 114:9677-9678, 1992; Eghohn et al., Nature 365:566-568, 1993; 
Almarsson et al.. Proa Natl Acad ScL U.S.A, 90:9542-9546, 1993; Demidov et al.. Proa 
5 Natl Acad Sci UXA, 92:2637-2641, 1995). They have also been shown to be resistant to 
nuclease and protease digestion (Demidov et al, Biochem, Pharm. 48:1010-1313, 1994). 
PNA has been used to inhibit gene expression (Hanvey et al., Science 258:1481- 
1485,1992; Nielsen et al, Nucl Acids, Res,, 21:197-200, 1993; Nielsen et al, Gene 
149:139-145, 1994; Good & Nielsen, Science, 95: 2073-2076, 1998; all of which are 

10 hereby incorporated by reference), to block restriction enzyme activity (Nielsen et al., 
supra, 1993), to act as an artificial transcription promoter (MoUegaard, Proc, Natl Acad, 
Sci US. A. 91:3892-3895, 1994) and as a pseudo restriction endonuclease (Demidov et al., 
Nucl Acids. Res, 21:2103-2107, 1993). Recently, PNA has also been shown to have 
antiviral and antitumoral activity mediated through an antisense mechanism (Norton, 

15 Nature Biotechnol, 14:615-619, 1996; Hirschman et al., J. Investig, Med, 44:347-351, 

1996), PNAs have been linked to various peptides in order to promote PNA entry into 
cells (Basu et al., BioconJ, Chem, 8:481-488, 1997; Pardridge et al., Proc. Natl Acad ScL 
U,SA, 92:5592-5596, 1995). 

The antisense oligonucleotides contemplated by the present invention can be 

20 administered by direct application of oligonucleotides to a target using standard 

techniques well known in the art. The antisense oligonucleotides can be generated 
within the target using a plasmid, or a phage. Alternatively, the antisense nucleic acid 
may be expressed from a sequence in the chromosome of the target cell It is further 
contemplated that contemplated that the antisense oligonucleotide contemplated are 

25 incorporated in a ribozyme sequence to enable the antisense to specifically bind and cleave 

its target mRNA. For technical applications of ribozyme and antisense oligonucleotides 
see Rossi et aL, PharmacoL Then 50(2) :245 -254, (1991), which is hereby incorporated 
by reference. The present invention also contemplates using a retron to introduce an 
antisense oligonucleotide to a cell. Retron technology is exemplified by U.S. Patent 

30 No. 5,405,775, which is hereby incorporated by reference. Antisense oligonucleotides 
can also be delivered using liposomes or by electroporation techniques which are well 
known in the art. 
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The antisense nucleic acids of the present invention can also be used to design 
antibiotic compounds comprising nucleic acids which function by intracellular triple helix 
formation. Triple helix oligonucleotides are used to inhibit transcription from a genome. 
The sequences identified as required for proliferation in the present invention, or portions 
5 thereof, can be used as templates to inhibit microorganism gene expression in individuals 
infected with such organisms. Traditionally, homopurine sequences were considered the 
most useful for triple helix strategies. However, homopyrimidine sequences can also 
inhibit gene expression. Such homopyrimidine oligonucleotides bind to the major groove 
at homopurine:homopyrimidine sequences. Thus, both types of sequences based on the 

10 sequences of the present invention that are required for proliferation are contemplated for 

use as antibiotic compound templates. 

The antisense oligonucleotides of this example employ the identified sequences 
of the present invention to induce bacterial cell death or at least bacterial stasis by 
inhibiting target gene translation. Antisense oligonucleotides containing from about 8 

15 to 40 bases of the sequences of the present invention have sufficient complementary to 
form a duplex with the target sequence under physiological conditions. 

To kill bacterial cells or inhibit their grovvth, the antisense oligonucleotides are 
applied to the bacteria or to the target cells under conditions that facilitate their uptake. 
These conditions include sufficient incubation times of cells and oligonucleotides so 

20 that the antisense oligonucleotides are taken up by the cells. In one embodiment, an 
incubation period of 7-10 days is sufficient to kill bacteria in a sample. An optimum 
concentration of antisense oligonucleotides is selected for use. 

The concentration of antisense oligonucleotides to be used can vary depending 
on the type of bacteria sought to be controlled, the nature of the antisense 

25 oligonucleotide to be used, and the relative toxicity of the antisense oligonucleotide to 

the desired cells in the treated culture. Antisense oligonucleotides can be introduced to 
cell samples at a number of different concentrations preferably between IxlO'^^M to 1x10' 
"^M. Once the minimum concentration that can adequately control gene expression is 
identified, the optimized dose is translated into a dosage suitable for use in vivo. For 

30 example, an inhibiting concentration in culture of 1x10'^ translates into a dose of 

approximately 0.6 mg/kg body weight. Levels of oligonucleotide approaching 100 mg/kg 
body weight or higher may be possible after testing the toxicity of the oligonucleotide in 
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laboratory animals. It is additionally contemplated that cells from the subject are removed, 
treated with the antisense oligonucleotide, and reintroduced into the subject. This range is 
merely illustrative and one of skill in the art are able to determine the optimal 
concentration to be used in a given case. 
5 After the bacterial cells have been killed or controlled in a desired culture, the 

desired cell population may be used for other purposes. 

EXAMPLE 41 

The following example demonstrates the ability of an E. coli antisense 
oligonucleotide to act as a bactericidal or bacteriostatic agent to treat a contaminated 
10 cell culture system. The application of the antisense oligonucleotides of the present 
invention are thought to inhibit the translation of bacterial gene products required for 
proliferation. 

The antisense oligonucleotide of this example corresponds to a 30 base 
phophorothioate modified oligodeoxynucelotide complementary to a nucleic acid 

15 involved in proliferation, such as Molecule Number EcXAOOl. A sense 
oligodeoxynucelotide complementary to the antisense sequence is synthesized and used 
as a control. The oligonucleotides are synthesized and purified according to the 
procedures of Matsukura, et al., Gene 72:343 (1988). The test oligonucleotides are 
dissolved in a small volume of autoclaved water and added to culture medium to make a 

20 100 micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two 
patients and cultured according standard procedures well known in the art. The culture 
is contaminated with the K-12 strain of E. coli and incubated at 37°C overnight to 
establish bacterial infection. 

25 The control and antisense oligonucleotide containing solutions are added to the 

contaminated cultures and monitored for bacterial growth. After a 10 hour incubation 
of culture and oligonucleotides, samples from the control and experimental cultures are 
drawn and analyzed for the translation of the target bacterial gene using standard 
microbiological techniques well known in the art. The target E. coli gene is found to be 

30 translated in the control culture treated with the control oligonucleotide, however, 
translation of the target gene in the experimental culture treated with the antisense 
oligonucleotide of the present invention is not detected or reduced. 
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EXAMPLE 42 

A subject suffering from an E. coli infection is treated with the antisense 
oligonucleotide preparation of Example 39. The antisense oligonucleotide is provided 
in a pharmaceutically acceptable carrier at a concentration effective to inhibit the 
translation of the target gene. The present subject is treated with a concentration of 
antisense oligonucleotide sufficient to achieve a blood concentration of about 100 
micromolar. The patient receives daily injections of antisense oligonucleotide to 
maintain this concentration for a period of 1 week. At the end of the week a blood 
sample is drawn and analyzed for the presence or absence using standard techniques 
well known in the art. There is no detectable evidence of E. coli and the treatment is 
terminated. 

EXAMPLE 43 

Preparation and use of Triple Helix Probes 

The sequences of microorganism genes required for proliferation of the present 
invention are scanned to identify 10-mer to 20-mer homopyrimidme or homopurine 
stretches that could be used in triple-helix based strategies for inhibiting gene expression. 
Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in mhibiting gene expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into a population of bacterial cells 
that normally express the target gene. The oligonucleotides may be prepared on an 
oligonucleotide synthesizer or they may be purchased commercially from a company 
specializing in custom oligonucleotide synthesis, such as GENSET, Paris, France. 

The oligonucleotides can be introduced into the cells using a variety of methods 
known to those skilled in the art, including but not limited to calcium phosphate 
precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection or native 
uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such 
as monitoring growth levels as compared to untreated cells using optical density 
measurements. The oligonucleotides that are effective in inhibiting gene expression in 
cultured cells can then be introduced in vivo using the techniques well known in that art at 
a dosage level shown to be effective. 
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In some embodiments, the natural (beta) anomers of the oligonucleotide units can 
be replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. 
Further, an intercalating agent such as ethidium bromide, or the like, can be attached to the 
y end of the alpha oligonucleotide to stabilize the triple helix. For information on the 
5 generation of oligonucleotides suitable for triple helix formation see Griffin et al (Science 

245:967-971 (1989), which is hereby incorporated by this reference). 

EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens by PGR 
Glassical bacteriological methods for the detection of various bacterial species 
10 are time consuming and costly. These methods include growing the bacteria isolated 
from a subject in specialized media, cultivation on selective agar media, followed by a 
set of confirmation assays that can take from 8 to 10 days or longer to complete. Use of 
the identified sequences of the present invention provides a method to dramatically 
reduce the time necessary to detect and identify specific bacterial species present in a 
15 sample. 

In one exemplary method, bacteria are grown in enriched media and DNA samples 
are isolated firom specimens of, for example, blood, urine, stool, saliva or central nervous 
system fluid by conventional methods. A panel of PGR primers based on identified 
sequences unique to various species of microorganisms are then utilized in accordance 

20 with Example 12 to amplify DNA of approximately 100-200 bases in length firom the 

specimen. A separate PGR reaction is set up for each pair of PGR primers and after the 
PGR reaction is complete, the reaction mixtures are assayed for the presence of PGR 
product. The presence or absence of bacteria from the species to which the PGR primer 
pairs belong is determined by the presence or absence of a PGR product in the various test 

25 PGR reaction tubes. 

Although the PGR reaction is used to assay the isolated sample for the presence of 
various bacterial species, other assays such as the Southern blot hybridization are also 
contemplated. 

All documents cited herein are incorporated herein by reference in their entireties, 

30 
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WHAT IS CLAIMED IS : 

1 . A purified or isolated nucleic acid sequence consisting essentially of one 
of SEQ ID NOs: 405-485, wherein said nucleic acid inhibits microorganism 
proliferation. 

5 2. The nucleic acid sequence of Claim 1, wherein said nucleic acid 

sequence is complementary to at least a portion of a coding sequence of a gene whose 
expression is required for microorganism proliferation. 

3. The nucleic acid sequence of Claims 1 or 2, wherein said nucleic acid 
comprises a fragment of one of SEQ ID NOs. 405-485, said fragment selected from the 

10 group consisting of fragments comprising at least 10, at least 20, at least 25, at least 30, 

at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 405-485. 

4. The nucleic acid sequence of Claim 3, wherein said nucleic acid 
sequence is complementary to a coding sequence of a gene whose expression is 
required for microorganism proliferation. 

15 5. A vector comprising a promoter operably linked to a nucleic acid 

comprising a sequence selected from the group consisting of SEQ ID NOs. 405-485. 

6. The vector of Claim 5, wherein said promoter is active in an organism 
selected from the group consisting of Escherichia coli, Staphylococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 

20 gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 
influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neofi)rmans, Aspergillus fiimigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, 
Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 

25 anthracis. Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia 

trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 
the above species. 

7. A host cell containing the vector of Claim 5 or Claim 6. 

8. A purified or isolated nucleic acid consisting essentially of the coding 
30 sequence of one of SEQ ID NOs: 82-88, 90-242. 
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9. A fragment of the nucleic acid of Claim 8, said fragment comprising at 
least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases 
of one of SEQ ID NOs: 82-88, 90-242. 

10. A vector comprising a promoter operably linked to the nucleic acid of 
5 Claim 8 or Claim 9. 

IL A purified or isolated nucleic acid comprising a nucleic acid sequence 
complementary to at least a portion of an intragenic sequence, intergenic sequence, 
sequences spanning at least a portion of tv^o or more genes, 5' noncoding region, or 3' 
noncoding region within an operon encoding a polypeptide comprising a sequence 

10 selected from the group consisting of SEQ ID NOs: 243-357, 359-398. 

12. A purified or isolated nucleic acid comprising a nucleic acid having at 
least 70% homology to a sequence selected from the group consisting of SEQ ID NOs 
405-485, 82-88, 90-242 or the sequences complementary thereto as determined using 
BLASTN version 2.0 with the default parameters. 

15 13. The nucleic acid of Claim 12, wherein said nucleic acid is from an 

organism selected from the group consisting of Staphylococcus aureus, Pseudomonas 
aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 

20 Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi. 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni, and Chlamydia trachomatus. Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. 

25 14. A purified or isolated nucleic acid consisting essentially of a nucleic acid 

encoding a polypeptide having a sequence selected from the group consisting of SEQ 
ID NOs.: 243-357, 359-398. 

15. A vector comprising a promoter operably linked to a nucleic acid 
encoding a polypeptide having a sequence selected from the group consisting of SEQ 

30 ID NOs.: 243-357, 359-398. 

1 6. A host cell containing the vector of Claim 15. 
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17. A purified or isolated polypeptide comprising the sequence of one of 
SEQ ID NOs: 243-357, 359-398. 

18. A purified or isolated polypeptide comprising a fragment of one of the 
polypeptides of SEQ ID NOs. 243-357, 359-398, said fragment selected from the group 

5 consisting of fragments comprising at least 5, at least 10, at least 20, at least 30, at least 
40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the 
polypeptides of SEQ ID NOs.: 243-357, 359-398. 

19. An antibody capable of specifically binding the polypeptide of Claim 17 
or Claim 18. 

10 20. A method of producing a polypeptide, comprising introducing a vector 

comprising a promoter operably linked to a nucleic acid encoding a polypeptide having 
a sequence selected from the group consisting of SEQ ID NOs. 243-357, 359-398into a 
cell. 

21. The method of Claim 20, further comprising the step of isolating said 

15 protein. 

22. A method of inhibiting proliferation comprising inhibiting the activity or 
reducing the amount of a polypeptide having a sequence selected from the group 
consisting of SEQ ID NOs. 243-357, 359-398or inhibiting the activity or reducing the 
amount of a nucleic acid encoding said polypeptide. 

20 23. A method for identifying compounds which influence the activity of a 

polypeptide required for proliferation comprising: 

contacting a polypeptide having a sequence selected from the group 
consisting of 243-357, 359-398with a candidate compound; and 

determining whether said compound influences the activity of said 
25 polypeptide, 

24. The method of Claim 23, wherein said activity is an enzymatic activity. 

25. The method of Claim 23, wherein said activity is a carbon compound 
catabolism activity. 

26. The method of Claim 23, wherein said activity is a biosynthetic activity. 
30 27. The method of Claim 23, wherein said activity is a transporter activity. 

28. The method of Claim 23, wherein said activity is a transcriptional 
activity. 
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29. The method of Claim 23, wherein said activity is a DNA replication 
activity. 

30. The method of Claim 23, wherein said activity is a cell division activity. 

31. A method for assaying compounds for the ability to reduce the activity or 
5 level of a polypeptide required for proliferation, comprising: 

providing a target, wherein said target comprises the coding sequence of 
a sequence selected from the group consisting of SEQ ID NOs. 82-88, 90-242; 
contacting said target with a candidate compound; and 
measuring an activity of said target. 
10 32. The method of Claim 31, wherein said target is a messenger RNA 

molecule transcribed from a coding region of one of SEQ ID. NOs.: 82-88, 90-242 and 
said activity is translation of said messenger RNA. 

33. The method of Claim 32, wherein said target is a coding region of one of 
SEQ ID. NOs, 82-88, 90-242 and said activity is transcription of said messenger RNA. 
15 34. A compound identified using the method of Claim 3 1 . 

35. A method for identifying compounds which reduce the activity or level 
of a gene product required for cell proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said 
gene product in a cell to reduce the activity or amount of said gene product in 
20 said cell, thereby producing a sensitized cell; 

contacting said sensitized cell with a compound; and 
determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of a 
nonsensitized cell. 

25 36. The method of Claim 35, wherein said cell is selected from the group 

consisting of bacterial cells, fungal cells, plant cells, and animal cells. 

37. The method of Claim 36, wherein said cell is an E, coli cell. 

38. The method of Claim 36, wherein said cell is from an organism selected 
from the group consisting of Staphylococcus aureus, Pseudomonas aeruginosa, 

30 Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus 

faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus 
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fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi, Salmonella 
cholerasuiSy Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum. Bacillus anthracis, Yersinia pestis, Clostridium 
botulinum, Campylobacter jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae 
5 or any species falling within the genera of any of the above species. 

39. The method of Claim 35, wherein said antisense nucleic acid is 
transcribed from an inducible promoter. 

40. The method of Claim 39, further comprising the step of contacting said 
cell with a concentration of inducer which induces said antisense nucleic acid to a 

10 sublethal level. 

41. The method of Claim 40, wherein said sub-lethal concentration of said 
inducer is such that growth inhibition is 8% or more. 

42. The method of Claim 40, wherein said inducer is isopropyl-l-thio-P-D- 
galactoside. 

15 43. The method of Claim 35, wherein growth inhibition is measured by 

monitoring optical density of a culture growth solution. 

44. The method of Claim 35, wherein said gene product is a polypeptide. 

45. The method of Claim 35, wherein said gene product is an RNA. 

46. The method of Claim 44, wherein said gene product comprises a 
20 polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 

243-357, 359-398. 

47. A compound identified using the method of Claim 35. 

48. A method for inhibiting cellular proliferation comprising introducing a 
compound with activity against a gene corresponding to one of SEQ ID NOs.: 82-88, 

25 90-242 or with activity against the product of said gene into a population of cells 

expressing a gene. 

49. The method of Claim 48, wherein said compound is an antisense 
oligonucleotide comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 405-485, or a proliferation-inhibiting portion thereof. 

30 50. The method of Claim 49, wherein said proliferation inhibiting portion of 

one of SEQ ID NOs. 405-485 is a fragment comprising at least 10, at least 20, at least 



-113- 



25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 
405-485. 

51. The method of Claim 48, wherein said compound is a triple helix 
oligonucleotide. 

5 52. A preparation comprising an effective concentration of an antisense 

oligonucleotide comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 405-485, or a proliferation-inhibiting portion thereof in a pharmaceutically 
acceptable carrier. 

53. The preparation of Claim 52, wherein said proliferation-inhibiting 
10 portion of one of SEQ ID NOs. 405-485 comprises at least 10, at least 20, at least 25, at 

least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 405-485. 

54. A method for inhibiting the expression of a gene in an operon required 
for proliferation comprising contacting a cell in a cell population with an antisense 
nucleic acid, said cell expressing a gene corresponding to one of SEQ ID NOs.: 82-88, 

15 90-242, wherein said antisense nucleic acid comprises at least a proliferation-inhibiting 

portion of said operon in an antisense orientation that is effective in inhibiting 
expression of said gene. 

55. The method of Claim 54, wherein said antisense nucleic acid is 
complementary to a sequence of a gene comprising one or more of SEQ ID NOs.: 82- 

20 88, 90-242. 

56. The method of Claim 54, wherein said antisense nucleic acid is a 
sequence of one of SEQ ID NOs.: 405-485, or a portion thereof. 

57. The method of Claim 54, wherein said cell is contacted with said 
antisense nucleic acid by introducing a plasmid which expresses said antisense nucleic 

25 acid into said cell population. 

58. The method of Claim 54, wherein said cell is contacted with said 
antisense nucleic acid by introducing a phage which expresses said antisense nucleic 
acid into said cell population. 

59. The method of Claim 54, wherein said cell is contacted with said 
30 antisense nucleic acid by introducing a sequence encoding said antisense nucleic acid 

into the chromosome of said cell into said cell population. 
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60. The method of Claim 54, wherein said cell is contacted with said 
antisense nucleic acid by introducing a retron which expresses said antisense nucleic 
acid into said cell population. 

61. The method of Claim 54, wherein said cell is contacted with said 
5 antisense nucleic acid by introducing a ribozyme into said cell-population, wherein a 

binding portion of said ribozyme is complementary to said antisense oligonucleotide. 

62. The method of Claim 54, wherein said cell is contacted with said 
antisense nucleic acid by introducing a liposome comprising said antisense 
oligonucleotide into said cell. 

10 63. The method of Claim 54, wherein said cell is contacted with said 

antisense nucleic acid by electroporation. 

64. The method of Claim 54, wherein said antisense nucleic acid is a 

fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more 

than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90-242. 
15 65. The method of Claim 54 wherein said antisense nucleic acid is an 

oligonucleotide. 

66. A method for identifying bacterial strains comprising the steps of: 
providing a sample containing a bacterial species; and 
identifying a bacterial species using a species specific probe having a 
20 sequence selected from the group consisting of SEQ ID NOs. 405-485, 82-88, 

90-242. 

67. A method for identifying a gene in a microorganism required for 
proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity 
25 of a gene or gene product required for proliferation in a first microorganism; 

(b) contacting a second microorganism with said inhibitory nucleic 

acid; 

(c) determining whether said inhibitory nucleic acid from said first 
microorganism inhibits proliferation of said second microorganism; and 

30 (d) identifying the gene in said second microorganism which is 

inhibited by said inhibitory nucleic acid. 
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68. A method for assaying a compound for the ability to inhibit proliferation 
of a microorganism comprising: 

(a) identifying a gene or gene product required for proliferation in a 
first microorganism; 

5 (b) identifying a homolog of said gene or gene product in a second 

microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the 
activity of said homolog in said second microorgansim; 

(d) contacting said second microorganism with a proliferation- 
10 inhibiting amount of said inhibitory nucleic acid, thus sensitizing said second 

microorganism; 

(e) contacting the sensitized microorganism of step (d) with a 
compound; and 

(f) determining whether said compound inhibits proliferation of said 
15 sensitized microorganism to a greater extent than said compound inhibits 

proliferation of a nonsensitized microorganism, 

69. The method of Claim 68, wherein said step of identifying a gene 
involved in proliferation in a first microorganism comprises: 

introducing a nucleic acid comprising a random genomic fragment from 
20 said first microorganism operably linked to a promoter wherein said random 

genomic firagment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a 
first level of said random genomic fragment to the proliferation of said first 
microorganism transcribing a lower level of said random genomic fragment, 
25 wherein a difference in proliferation indicates that said random genomic 

fragment comprises a gene involved in proliferation. 

70. The method of Claim 69, wherein said step of identifying a homolog of 
said gene in a second microorganism comprises identifying a homologous nucleic acid 
or a nucleic acid encoding a homologous polypeptide in a database using an algorithm 

30 selected from the group consisting of BLASTN version 2.0 with the default parameters 

and FASTA version 3.0t78 algorithm with the default parameters. 
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71. The method of Claim 69, wherein said step of identifying a homolog of 
said gene in a second microorganism comprises identifying a homologous nucleic acid 
or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids which 
hybridize to said first gene. 
5 72. The method of Claim 69, wherein the step of identifying a homolog of 

said gene in a second microorganism comprises expressing a nucleic acid which inhibits 
the proliferation of said first microorganism in said second microorganism. 

73. The method of Claim 69, wherein said inhibitory nucleic acid is an 
antisense nucleic acid. 

10 74. The method of Claim 69, wherein said inhibitory nucleic acid comprises 

an antisense nucleic acid to a portion of said homolog. 

75. The method of Claim 69, wherein said inhibitory nucleic acid comprises 
an antisense nucleic acid to a portion of the operon encoding said homolog. 

76. The method of Claim 69, wherein the step of contacting the second 
15 microorganism with a proliferation-inhibiting amount of said nucleic acid sequence 

comprises directly contacting said second microorganism with said nucleic acid. 

77. The method of Claim 69, wherein the step of contacting the second 
microorganism with a proliferation-inhibiting amount of said nucleic acid sequence 
comprises expressing an antisense nucleic acid to said homolog in said second 

20 microorganism. 

78. A compound identified using the method of Claim 68. 

79. A method of assaying a compound for the ability to inhibit proliferation 
comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the 
25 activity of a gene or gene product required for proliferation in a first 

microorgansim; 

(b) contacting a second microorganism with a proliferation-inhibiting 
amount of said inhibitory nucleic acid, thus sensitizing said second 
microorganism; 

30 (c) contacting the proliferation-inhibited microorganism of step (b) 

with a compound; and 
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(d) determining whether said compound inhibits proliferation of said 
sensitized second microorganism to a greater extent than said compound inhibits 
proliferation of a nonsensitized second microorganism. 

80. The method of Claim 79, wherein said inhibitory nucleic acid is an 
5 antisense nucleic acid which inhibits the proliferation of said first microorganism. 

81. The method of Claim 79, wherein said inhibitory nucleic acid comprises 
a portion of an antisense nucleic acid which inhibits the proliferation of said first 
microorganism. 

82. The method of Claim 79, wherein said inhibitory nucleic acid comprises 
10 an antisense molecule against the entire coding region of the gene involved in 

proliferation of the first microorganism. 

83. The method of Claim 79, wherein said inhibitory nucleic acid comprises 
an antisense nucleic acid to a portion of the operon encoding the gene involved in 
proliferation of the first microorganism. 

15 84. A compound identified using the method of Claim 79. 

85. A method for assaying compounds for activity against a biological 
pathway required for proliferation comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a 
nucleic acid encoding a gene product required for proliferation in a cell to 
20 reduce the activity or amount of said gene product; 

contacting the sensitized cell with a compound; and 
determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of an 
nonsensitized cell. 

25 86. The method of Claim 85, wherein said cell is selected from the group 

consisting of bacterial cells, fimgal cells, plant cells, and animal ceils. 

87. The method of Claim 86, wherein said cell is an E. coli cell. 

88, The method of Claim 85, wherein said cell is from an organism selected 
from the group consisting of Staphylococcus aureus, Pseudomonas aeruginosa, 

30 Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus 

faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus 
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fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi Salmonella 
cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium 
botulinum, Campylobacter jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae 
5 or any species falling within the genera of any of the above species. 

89. The method of Claim 85, wherein said antisense nucleic acid is 
transcribed from an inducible promoter. 

90. The method of Claim 89, further comprising contacting the cell with an 
agent which induces expression of said antisense nucleic acid from said inducible 

] 0 promoter, wherein said antisense nucleic acid is expressed at a sublethal level. 

91. The method of Claim 90, wherein said sublethal level of said antisense 
nucleic acid inhibits proliferation by 8% or more. 

92. The method of Claim 90, wherein said agent is isopropyl-l-thio-P-D- 
galactoside (IPTG). 

15 93. The method of Claim 91, wherein inhibition of proliferation is measured 

by monitoring the optical density of a liquid culture. 

94. The method of Claim 85, wherein said gene product comprises a 
polypeptide having a sequence selected from the group consisting of SEQ ID NOs: 243- 
357, 359-398. 

20 95. A compound identified using the method of Claim 85. 

96. A method for assaying a compound for the ability to inhibit cellular 
proliferation comprising: 

contacting a cell with an agent which reduces the activity or level of a 
gene product required for proliferation of said cell; 
25 contacting said cell with said compound; and 

determining whether said compound reduces proliferation to a greater 
extent than said compound reduces proliferation of cells which have not been 
contacted with said agent. 

97. The method of Claim 96, wherein said agent which reduces the activity 
30 or level of a gene product required for proliferation of said cell comprises an antisense 

nucleic acid to a gene or operon required for proliferation. 
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98. The method of Claim 96, wherein said agent which reduces the activity 
or level of a gene product required for proliferation of said cell comprises an antibiotic. 

99. The method of Claim 96, wherein said cell contains a temperature 
sensitive mutation which reduces the activity or level of said gene product required for 

5 proliferation of said cell. 

100. The method of Claim 99, wherein said antisense nucleic acid is directed 
against the nucleic acid encoding the same functional domain of said gene product 
required for proliferation of said cell to which said antisense nucleic acid is directed. 

101. The method of Claim 99, wherein said antisense nucleic acid is directed 
10 against the nucleic acid a different functional domain of said gene product required for 

proliferation of said cell than the functional domain to which said antisense nucleic acid 
is directed. 

102. A compound identified using the method of Claim 96. 

103. A method for identifying the pathway in which a proliferation-required 
15 nucleic acid or its gene product lies comprising: 

expressing a sublethal level of an antisense nucleic acid directed against said 
proliferation-required nucleic acid in a cell; 

contacting said cell with an antibiotic, wherein the biological pathway on which 
said antibiotic acts is known; and 
20 determining whether said cell has a substantially greater sensitivity to said 

antibiotic than a cell which does not express said sublethal level of said antisense 
nucleic acid. 

104. A method for determining the pathway on which a test compound acts 
comprising: 

25 (a) expressing a sublethal level of an antisense nucleic acid directed against a 

proliferation-required nucleic acid in a cell, wherein the biological pathway in which 
said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said 
30 test compound than a cell which does not express said sublethal level of said antisense 

nucleic acid. 

105. The method of Claim 104, further comprising: 
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(d) expressing a sublethal level of a second antisense nucleic acid 
directed against a second proliferation-required nucleic acid in said cell, wherein 
said second proliferation-required nucleic acid is in a different biological 
pathway than said proliferation-required nucleic acid in step (a); and 

(e) determining whether said cell has a substantially greater sensitivity to 
said test compound than a cell which does not express said sublethal level of 
said second antisense nucleic acid. 

106. A purified or isolated nucleic acid consisting essentially of one of SEQ 
ID NOs: 358, 399-402. 

107. A compound identified using the method of Claim 23. 

108. A compound which interacts with the gene or gene product of a nucleic 
acid comprising a sequence of one of SEQ ID NOs: 82-88, 90-242 to inhibit 
proliferation. 

109. A compound which interacts with a polypeptide comprising one of SEQ 
ID NOs. 243-357, 359-398 to inhibit proliferation. 

110. A compound which interacts with a nucleic acid comprising one of SEQ 
ID NOs: 358, 399-402 to inhibit proliferation. 
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GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION OF E. 

COLI 

Abstract of the Disclosure 
The sequences of nucleic acids encoding proteins required for E. coli 

5 proliferation are disclosed. The nucleic acids can be used to express proteins or portions 

thereof, to obtain antibodies capable of specifically binding to the expressed proteins, and 
to use those expressed proteins as a screen to isolate candidate molecules for rational drug 
discovery programs. The nucleic acids can also be used to screen for homologous genes 
that are required for proliferation in microorganisms other than coli. The nucleic acids 

10 can also be used to design expression vectors and secretion vectors. The nucleic acids of 
the present invention can also be used in various assay systems to screen for proliferation 
required genes in other organisms as well as to screen for antimicrobial agents. 
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SEQUENCE LISTING 



<110> Zyskind, Judith 

Ohlsen, Kari L. 
Trawick, John 
Forsyth, R. Allyn 
Froelich, Jamie M. 
Carr, Grant J. 
Yamaiuoto, Robert T* 
Xu, H, Howard 



<120> GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION IN 
ESCHERICHIA COLI 

<130> ELITRA.OOIA 

<160> 485 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 159 
<212> DNA 
<213> E. Coli 



<400> 1 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 60 

atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 120 

aaaacgttca tattgtacct ttttgattaa ccattgggg 159 



<210> 2 
<211> 696 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) , . . (696) 
<223> n - A,T,C or G 



<400> 2 

gattacatca agcgcgcggt gggtttaccg ggcgataaag tcacttacga tccggtctca 60 

aaagagctga cgattcaacc gggatgcagt tccggccagg cgtgtgaaaa cgcgctgccg 120 

gtcacctact caaacgtgga accgagcgat ttcgttcaga ccttctcacg ccgtaatggt 180 

ggggaagcga ccagcggatt ctttgaagtg ccgaaaaacg aaaccaaaga aaatggaatt 240 

cgtctttccg agcgtaaaga gacactgggt gatgtgacgc accgcattct gacagtgccg 300 

attgcgcagg atcaggtggg gatgtattac cagcagccag ggcaacaact ggcaacctgg 360 

attgttcctc cgggacaata cttcatgatg ggcgacaacc gcgacaacag cgcggacagc 420 

cgttactggg gctttgtgcc ngaagcgaat ctggtcggtc nggcaacggc tatctggatg 480 

aacttcgata accaagaagg cgaatggccg aatggtctgc cctaantcgc attggcgnnt 540 

ccnttaatan ccacttcctt cnctttgtcc ccttatggca acacttaatt tattntaaan 600 

taatcncccg tggctnacaa atccccgcct tttnttaaaa atttccccna anttaaggtt 660 

ggcctccagt tgcccgnccc aaacactttg gncccc 696 



<210> 3 
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<211> 681 
<212> DNA 
<213> E. Coli 

<220> 

<221> inisc_f eature 
<222> (1) . . . (681) 
<223> n = A,T,C or G 



<400> 3 

ctgcagggta atgtcgccat taaactggcg caggcagcca aagagttgct ccgcttctac 60 

ccagtcggca gcgacaactt gcgttaaagt cgcaaaatta tcatctgcac tcactgcgtg 120 

acgtaagcgg atggagtggc cggaaacctc atagtgaccg cccaccagtt ggcctgcatc 180 

gctttgtagc gtacgcgcgg cattggcaat aagattcaga tactcagact cttccggggc 240 

cttcgccagc ataaaagagg aggatgctcg cgtatgcagc aactgctcca gcgcaaattg 300 

cagccgcggt tgagtatcac tgaataaagg atcgttttcg tcaatcaaat gtggctgagc 360 

aaatatttcc tgatagctat cggtatcagg aaccaggtca cgccatgcaa gtttcgtaat 420 

ggtcaaagtt gatgtttttt agtctgttgt caaagccgcn attataccng taaccggcac 4 80 

tacagcacac gtagaaagca cccgacaata ctcctggcat gggcgttaaa gctcacagga 540 

tggagatctt ttcttcactg gcctaaaaag ctgatattct gtaaagagtt acacngtaac 600 

attgagatcg ctatgaaata tcaacaactt ggaaaatctt gnaaagcngg ttggaaaatg 660 

gaaagtatct ggttaagaag c 681 



<210> 4 
<211> 289 
<212> DNA 
<213> E. Coli 



<400> 4 

ggcagaattt tacgctgacc aatgacgcga cgacgtggca tggaaatact ccgttgttaa 60 

ttcaggattg tccaaaactc tacgagttta gtttgacatt taagttaaaa cgtttggcct 120 

tacttaacgg agaaccatta agccttagga cgcttcacgc catacttgga acgagcctgc 180 

ttacggtctt taacgccgga gcagtcaagc gcaccacgta cggtgtggta acgaacaccc 240 

gggaggtctt taacacgacc gtcacggatc aggatcacgg agtgctcct 289 



<210> 5 
<211> 815 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (815) 
<223> n = A, T, C or G 



<400> 5 

gggagcttac atcagtaagt gaccgggatg agcgagcgaa gataacgcat ctgcggcgcg 60 

aaatatgaag ggggagagcc cttatagacc aggtagtaca cgtttggtta gggggcctgc 120 

atatggcccc ctttttcact tttatatctg tgcggtttaa tgccgggcag atcacatctc 180 

cgaggatttt agaatggctg aaattaccgc atccctggta aaagagctgc gtgagcgtac 24 0 

tggcgcaggc atgatggatt gcaaaaaagc actgactgaa gctaacggcg acatcgagct 300 

ggcaatcgaa aacatgcgta agtccggtgc tattaaagca gcgaaaaaag caggcaacgt 360 

tgctgctgac ggcgtgatca aaaccaaaat cgacggcaac tacggcatca ttctggaagt 420 

taactgccag actgacttcg ttgcaaaaga cgctggtttc caggcgttcg cagacaaagt 480 

tctggacgca gctgttgctg gcaaaatcac tgacgttgaa gttctgaaag cacagttcga 540 

agaagaacgt gttgcgctgg tagcgaaaat tggtgaaaac atcaacattc gccgcgttgc 600 

tgcgctggaa ggcgacgttc tgggttctta tcagcacggt gcgcgtatcg gccgttctgg 660 



-2- 



ttgctgctaa aagcgctgac gaagaactgg ttaaacacat cgttttgacc tttgttgcaa 
gccaagccag aattcagaga aactttccgc ttcaccggag gtcccaccca cangganccc 
cgattttntc agcatggtgg tcttcctncg gagtt 



720 
780 
815 



<210> 6 
<211> 403 
<212> DNA 
<213> E. Coli 

<400> 6 

caacactatt ttgttgaccg gaaaatggaa cactttccgc aatgcctgtt gctatcacgc 60 

ttaaaccatt tcattgcgat ttacacagaa cggacgtcct gtcgcagtat attaagtcgt 120 

cgatagaaac aagcattgaa aggcacagca gtagtcaaac agtgtgaaac gctactggcg 180 

ccttacagcg caaaaaggct ggtgactaaa aagtcaccag ccatcagcct gatttctcag 240 

gctgcaaccg gaagggttgg cttatttaac ttcaacttca gcgccagctt cttccagagc 300 

ttttttcagt gcttctgcgt cgtctttgct cacgccttct ttcagagcag ccggtgcaga 360 

ttctaccagg tctttagctt ctttcagacc caggccagtt gcg 403 

<210> 7 
<211> 149 
<212> DNA 
<213> E. Coli 

<400> 7 

gagctttttt cagtgcttct gcgtcgtctt tgctcacgcc ttctttcaga gcagccggtg 60 

cagattctac caggtcttta gcttctttca gacccaggcc agttgcgcca cgtactgctt 120 

tgataacagc aactttgtta gcgccagca 14 9 

<210> 8 
<211> 742 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (742) 
<223> n = A,T,C or G 



<400> 8 

ccatctgtcc attgagcgga cagtttgtgc aacactattt tgttgaccgg aaaatggaac 60 

actttccgca atgcctgttg ctatcacgct taaaccattt cattgcgatt tacacagaac 120 

ggacgtcctg tcgcagtata ttaagtcgtc gatagaaaca agcattgaaa ggcacagcag 180 

tagtcaaaca gtgtgaaacg ctactggcgc cttacagcgc aaaaaggctg gtgactaaaa 240 

agtcaccagc catcagcctg atttctcagg ctgcaaccgg aagggttggc ttatttaact 300 

tcaacttcag cgccagcttc ttccagagct tttttcagtg cttctgcgtc gtctttgctc 360 

acgccttctt tcagagcagc cggtgcagat tctaccaggt ctttagcttc tttcagaccc 420 

aggccagttg cgccacgtac tgctttgata acagcaactt tgttagcgcc agcagctttc 480 

agaattacgt cgaattcagt tntttcttca gcagcttcaa ccgggccagc agctacagct 540 

acagcagcag caagcggaaa caccgaattt ttcttccatt gcagagatca gttctacaac 600 

cgtccattac agacatagct gcaactgctt caatgatttt gatctttagt ggatagacat 660 

ttaaattgtt cctgaattat caagaaataa gtnttatacg taagccgaaa tgcgttaaaa 720 

aagataactg ngattaaagc ag 742 



<210> 9 
<211> 421 
<212> DNA 
<213> E. Coli 



-3- 



<400> 9 

agtagtcaaa cagtgtgaaa cgctactggc gccttacagc gcaaaaaggc tggtgactaa 60 

aaagtcacca gccatcagcc tgatttctca ggctgcaacc ggaagggttg gcttatttaa 120 

cttcaacttc agcgccagct tcttccagag cttttttcag tgcttctgcg tcgtctttgc 180 

tcacgccttc tttcagagca gccggtgcag attctaccag gtctttagct tctttcagac 240 

ccaggccagt tgcgccacgt actgctttga taacagcaac tttgttagcg ccagcagctt 300 

tcagaattac gtcgaattca gttttttctt cagcagcttc aaccgggcca gcagctacag 360 

ctacagcagc agcagcggaa acaccgaatt tttcttccat tgcagagatc agttctacaa 420 

c 421 



<210> 10 
<211> 126 
<212> DNA 
<213> E. Coli 



<400> 10 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 60 

gcagattcta ccaggtcttt agcttctttc agacccaggc cagttgcgcc acgtactgct 120 

ttgata 126 



<210> 11 
<211> 262 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (262) 
<223> n = A,T,C or G 



<400> 11 

ctgcaaccgg aagggttggc ttatttaact tcaacttcag cgccagcttc ttccagagct 60 

tttttcagtg cttctgcgtc gtctttgctc acgccttctt tcagagcagc cgntgcagat 120 

tctaccaggt ctttagcttc tttcagaccc aggccagttg cgccacgtac tgctttgata 180 

acagcaactt tgttagcgcc agcagctttc agaattacgt cgaattcagt tttttcttca 240 

gcagcttcaa ccgggccagc ag 262 



<210> 12 
<211> 202 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (202) 
<223> n = A,T,C or G 



<400> 12 

gcgcataccc tgcagcatcg gcccgatgga gatcaggtcg gcagaacgct gtaccgcttt 60 

gtaggtggtg ttaccggtgn tcagatccgg gaagatgaac acggtagcgc gacctgcaac 120 

cggagagttc ggcgctttgg attncgcaac gtcagccatt accgcagcgt cgtactgcag 180 

cggaccggcg atcatcaggt ca 202 



<210> 13 
<211> 261 
<212> DNA 
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<213> E. Coli 



<400> 13 



tctaggagta agaatagctt caaattcagc 
cttttgcccg gcatgacgcc gggctttttt 
gcaaacttct cgccatcaaa tagcccctga 
agagaaagaa acgccatctg aataaacggc 
gcggctttca aggcgtcaat t 



agttgacagt ggcataaacg taactggtga 
tattattccg tgacttccag cgtagtgaag 
ctggttagtt ttagcgcggg gatcactggc 
tcatcgggta acggaccgca ttcacgggcg 



60 
120 
180 
240 
261 



<210> 14 
<211> 224 
<212> DNA 
<213> E. Coli 

<400> 14 

ttcttttttt cgtcaacggt gtccagaatc attttattta cctcggggta cttatgctga 60 

tttttattat tatggggaag gtgttattta tgagtttcat ttatgccgta acgacaatga 120 

actcgggaat tagtataagc agcgcgagaa taataatcat tgtgcaaatg ctaatttaat 180 

taatactatt taaatattat tttgagcata tgcacataag gttg 224 

<210> 15 
<211> 232 
<212> DNA 
<213> E. Coli 

<400> 15 

aattcccttc tttttttcgt caacggtgtc cagaatcatt ttatttacct cgggtactta 60 

tgctgatttt tattattatg gggaaggtgt tatttatgag tttcatttat gccgtaacga 120 

caatgaactc gggaattagt ataagcagcg cgagaataat aatcattgtg caaatgctaa 180 

tttaattaat actatttaaa tattattttg agcatatgca cataaggttg gg 232 

<210> 16 
<211> 212 
<212> DNA 
<213> E. Coli 

<400> 16 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 60 

cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 120 

tatgccgtaa cgacaatgaa ctcgggaatt agtataagca gcgcgagaat aataatcatt 180 

gtgcaaatgc taatttaatt aatactattt aa 212 

<210> 17 
<211> 433 
<212> DNA 
<213> E. Coli 

<400> 17 

ccttgtaaat tatcgcccgt ggcataaaaa ctgcgtccaa acgccgtctt tgccagcagc 60 

caggccataa atgccaccag aattatcgtc aaccaaccaa ttgctgaaac gccaagcagc 120 

agcggggcgg agagctgttt cagttcggcg ggtaaccctt caatccattt gccgccagtc 180 

cacagcaaca tgatgcctct gtacaaccct aacgtgccaa gggtggcaac aatggcaggg 24 0 

atctttagcc acgcgaccag gacaccgttg aaaaatcccg cgagcaaacc aagcagtaaa 300 

gtcgcgacac aagcaacagg tagtgaatat cctgcgttca gtaacatccc caacagcacc 360 

gcgcacattc cggtaatcga acccactgaa acatcaatat tgcgcgtaag cattaccagc 420 

gtcgcgccca ttg 433 



-5- 



<210> 18 
<211> 658 
<212> DNA 
<213> E. Coli 



<400> 18 

cgtgcgcttc cggttgtggc aacccgcgaa atggcgcggc ggtaagtatg gcggggttat 60 

tccttccccg ttgaggacac cgggttgtca ggttgaccat acgcttaagt gacaaccccg 120 

ctgcaacgcc ctctgttatc aattttctgg tgacgtttgg cggtatcagt tttactccgt 180 

gactgctctg ccgccctttt taaagtgaat tttgtgatgt ggtgaatgcg gctgagcgca 24 0 

cgcggaacag ttaaaaccaa aaacagtgtt atgggtggat tctctgtatc cggcgttaat 300 

tgttaactgg ttaacgtcac ctggaggcac caggcactgc atcacaaaat tcattgttga 360 

ggacgcgata atgaaaacgt tattaccaaa cgttaatacg tctgaaggtt gttttgaaat 420 

tggtgtcact atcagtaacc cagtatttac tgaagatgcc attaacaaga gaaaacaaga 480 

acgggagcta ttaaataaaa tatgcattgt ttcaatgctg gctcgtttac gtctgatgcc 540 

aaaaggatgt gcacaatgaa ttcagcattt gtgcttgttc tgacagtttt tcttgtttcc 600 

ggagagccag ttgatattgc agtcaagtgg tcacaggaca atgcaggagt gtatgact 658 



<210> 19 
<211> 588 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (588) 
<223> n = A,T,C or G 



<400> 19 

gtgactgctc tgccgccctt tttaaagtga attttgtgat gtggtgaatg cggctgagcg 60 

cacgcggaac agttaaaacc aaaaacagtg ttatgggtgg attctctgta tccggcgtta 120 

attgttaact ggttaacgtc acctggaggc accaggcact gcatcacaaa attcattgtt 180 

gaggacgcga taatgaaaac gttattacca aacgttaata cgtctgaagg ttgttttgaa 240 

attggtgtca ctatcagtaa cccagtattt actgaagatg ccattaacaa gagaaaacaa 300 

gaacgggagc tattaaataa aatatgcatt gtttcaatgc tggctcgttt acgtctgatg 360 

ccaaaaggat gtgcacaatg aattcagcat ttgtgcttgt tctgacagtt tttcttgttt 420 

ccggagagcc agttgatatt gcagtcagtg ttcacaggac aatgcangag tgtatgactg 480 

cagcaacccg aacagaaaat tcccggtaac tgttacccgg tcgataaagt tattcaccag 540 

gataatatcg aaatcccggc aggtctttaa aacagttccg taataaat 588 



<210> 20 

<211> 101 

<212> DNA 

<213> E. Coli 

<400> 20 

gatccagcaa gaagatgcgg ttgtaccgtc atcacgcaga tgcgcaaagc tactcagcaa 60 
ctgacctttc ttcgcaataa gcacgccatt agcgtcatag a 101 

<210> 21 
<211> 465 
<212> DNA 
<213> E. Coli 

<400> 21 

tcgcgtgttt accttcaaca tcggtaactt tctggcggat agtttcacgg taagcaacct 60 
gcggtttacc tacgttcgct tcaacgttga attcacgctt catacggtca acgatgatgt 120 
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cgaggtgcag 
cacggaaaga 
cagctttggt 
ccagaatgat 
cgatagcagc 
gcatctgaac 



ttcgcccata 
cgggtcttct 
tttcggttca 
cggcgcatcc 
agcgatgtcg 
gatacgaccg 



cccgcgatga 
ttagccagac 
actgcgatgg 
gggtcacaca 
cccgcgcgaa 
aaacgctcac 



tggtctggtt 
ggcccagagc 
agattaccgg 
gggtgtcacc 
cttctttgat 
gtgcagcttt 



agattcttcg 
cagacccatt 
ctcagggaat 
agtggttacg 
ctcttcacgt 
cacgg 



tcagtccata 
ttttcctggt 
tccatacgtt 
tctttcagac 
ttgttagcgt 



<210> 22 
<211> 859 
<212> DNA 
<213> Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (859) 
<223> n = A,T,C or G 



<400> 
tgatcggctc 
cagccagttt 
gaatacccat 
tacctttatc 
tgaactcgta 
gaccatactg 
tctggcggat 
attcacgctt 
tggtctggtt 
gggccanagc 
gattaccggc 
anggngtacc 
gccnaacttc 
aaaagngtta 
cggggtnaac 



22 

aagcagaact 
aaacgccagt 
gtctactacc 
aacggccggg 
gcctttcggg 
accacgacca 
agtttcacgg 
catacggtca 
agattcttcg 
cagacccatt 
tcanggaatt 
aggggggtac 
tttggaacnn 
anngccantt 
ccngnaaaa 



ggtttcgctt 
tcagaggagt 
gggtagcctg 
atgtattcgc 
tttgaacccg 
ccagactgtt 
taagcaacct 
acgatgatgt 
tcagtccata 
ttttcctggt 
tccatacctt 
ntntttttaa 
tttaccggtt 
ttccnggngt 



tcttaaagcc 
caacgtcatg 
ccagcggacc 
cagggattac 
gctccagcgg 
tcgcgtgttt 
gcggtttacc 
cgaggtgcag 
cacggaaaga 
cagctttggt 
ccaggaatga 
nancgattgc 
ggtaaccngc 
tnanntncgg 



ttctttaaag 
gtaagaaccg 
tgctttcagc 
accaccttta 
gtacatgtcg 
accttcaaca 
tacgttcgct 
ttcgcccata 
cgggtcttct 
tttcggtcaa 
tcggcgcatt 
cagcancgga 
cttttnaacn 
nttcccngaa 



gcgatagaag 
aagtgcagac 
tgttcctgga 
atgtcgttga 
ataacaacat 
tcggtaactt 
tcaacgttga 
cccgcgatga 
ttagccagac 
ctgcgatgga 
ccggtcaaac 
tntnncccgn 
atccaaccga 
ntaacccncc 



<210> 23 
<211> 269 
<212> DNA 
<213> E. Coli 



180 
240 
300 
360 
420 
465 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
859 



<400> 23 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg 

agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 

ctgccagcgg acctgctttc agctgttcct ggataccttt atcaacggcc gggatgtatt 

cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 
ccggctccag cgggtacatg tcgataaca 



60 
120 
180 
240 
269 



<210> 24 
<211> 330 
<212> DNA 
<213> E. Coli 



<400> 24 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgcgtccg gtgaaccagt 180 

cggacgcacc tttaataact ataaataagt gtctgggcag atactatata aattaactta 240 

gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag gcttcccagt 300 
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aatataatta atactctact tccagagtag 



330 



<210> 25 
<211> 471 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (471) 
<223> n = A,T,C or G 



<400> 25 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgccgtcc ggtgaaccag 180 

tcggacgcac ctttaataac tataaataag tgtctgggca gatactatat aaattaactt 240 

agtgaatgat tatgctaatg tcatcaatta aataaatata atggcgttaa ggcttcccag 300 

taatataatt aatactctac ttccagagta gaatattaaa ttttatccgc gtggtgcatc 360 

agcacaaatt tatcccacaa ctgttcttct gtctcgacat gccccccgat ctttnacaaa 420 

tantattggg ggattnggcc cncctttttg ncaggttggg gtcntctnat g 471 



<210> 26 
<211> 379 
<212> DNA 
<213> E. Coli 

<220> 

<221> mi sc__f eature 
<222> (1) . . . (379) 
<223> n = A,T,C or G 



<400> 26 

natctgantg gctgcattcc ttgtttaagg aaacccgaat gactgattgc cgatacctga 60 

ttaaacgggt catcaaaatc atcattgctg ttttacagct gatccttctg ttcttataac 120 

acaaggaaac gtacttaagg tgcgtccggt gaaccagtcg gacgcacctt taataactat 180 

aaataagtgt ctgggcagat actatataaa ttaacttagt gaatgattat gctaatgtca 240 

tcaattaaat aaatataatg gcgttaaggc ttcccagtaa tataattaat actctacttc 300 

cagagtagaa tattaaattt tatccgcgtg gtgcatcagc acaaatttat cccacaactg 360 

ttcttctgtc tcgacatgc 379 



<210> 27 
<211> 799 
<212> DNA 
<213> E. Coli 



<400> 27 

aaagatgatg tgatgagaaa gtcaatttga ataagacaat attaagagct aaaaaaatgt 60 

caaaaaacac taaatcaaaa aataatggca ttagaaaata taatgcgaaa acggaggtga 120 

aattagttta tttcaaatga ggaaaatctc ccggcgaaaa aaccgggaga tgaaagtgtg 180 

atgggtatca aataaacaac agaggagaaa tttttaacgc agccattcag gcaaatcgtt 240 

taatcccatt gcctggcgga taagttgcgg cttaacgcca ggaagcgtgt cggccagttt 300 

caaaccaata tcacgcagca gttttttcgc cggattggta ccggaaaaca gatcgcggaa 360 

tccctgcata ccagccagca tcaacgccgc actgtgcttg cggctacgct catagcgacg 420 

cagataaatg tactgcccga tgtctgggat ccgtcgacct gcagccaagc ttgggctttt 4 80 

cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg 540 

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgcccgta 600 
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gcgcccgatg gtagtgtggg gtctccccat gcgagagtag ggaactgcca ggcatcaaat 
aaaacgaaag gctcagtcga aagactgggc ctttcggttt atctggtggt tgtcggtgaa 
cgctctctga gtaggacaaa tccgccggga gcggattttg aacgttgcga aacaaccggc 
ccggaaaggg gtgggggct 

<210> 28 
<211> 636 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (636) 
<223> n - A,T,C or G 



<400> 28 

agggggtttg ttgtgggcaa tgatgcattt aagttatcgt ctgcagatag aggagatatt 60 

acaataaaca acgaatcagg gcatttgata gtcaataccg caattctatc aggagatata 120 

gtcactctaa gaggaggaga aattaggttg gtattatagc ttgtgcgcgc catgattggc 180 

gcgcaattta aacttagtgc tttacatcgc tattgtcttg atttctttga attattttat 240 

aaattaaaaa aacgactgtt atgtataagc aaaggtcgaa cgaaaaatac attccaaata 300 

aatgcttgct taaatctcta tatccttccc cgaaaaatga cacataaaat tgagatattc 360 

caaaaagaga tactacaaat aaagatgcct ttattttatt atttctaata aaaatagaag 420 

caataaaaaa taataacaat gatataaatc taatgttttt aaatatattg tcttttatgt 480 

tagtaatagt cgttagtatg tttgattctc catatattac gtgtagtttt ttatatacat 540 

ggaaataatt ntctttatac tgagacatca caccatcatc aaatggaagt ttgaagatgg 600 

tgcttggttt gctaaccaat aaaaagagtg cattcg 636 



<210> 29 
<211> 757 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (757) 
<223> n = A, T,C or G 



<400> 29 

cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg ggagcataaa 60 

gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac tcggcactga 120 

gctgggatgg tggtaacgtc acctctaaaa aatagcaaag gctgcctgtg tgcagccttt 180 

gtgcaattta agcgttaact tttaatcttc ctgtagataa atagcacgac aatcgcacca 240 

ataacggcaa ccacgaagct gccaaaattg aagccatcga ctttaccaaa gccaaacagc 300 

gtgctgatcc atccgccgac tacggcaccg actatcccca gcaggatagt cataaagaat 360 

ccacctccat ctttacctgg catgatccac ttcgccagaa taccggcaat aagcccaaaa 420 

ataatccatg acagaatgcc cattgtttcc tcacttatct gttttgcatt agcgggttag 480 

tcgctgataa aaagcatagc acaacatcgg gagggcaaga tttgtgacga gcatcacgga 540 

ggtttttttt gcgatggcgc agaaattgcg ccatcaacga tcagtgataa ttaccaacca 600 

caaacatcat gttcgttttc cgtgtcataa gaaccgtacg ggattcacca gatcttttat 660 

cacttcaagc cggcacttct ggcaccagca aagtcatcgg cgtctctggt tcataatcga 720 

ccggaaacgc cattgctggt attggtgacn gtcacgg 757 



<210> 30 
<211> 392 
<212> DNA 
<213> E. Coli 



-9- 



<400> 30 

aattacagaa aaaggaggca atatcgggta aaggcattag cccgacgaat acgtcgggct 60 

acaaatatta ttgtgctgca ggtgttttag cgggttgttg atccacaggt tctaactgga 120 

agaccacatc gacctgatca tcaaactgaa tagcggcctg ctcgtaagtt tcctgggcgg 180 

acaccggcgc ggcatcggct ttcatcatcc gcaccattgg gctgggctga tagttggaaa 240 

catggtagcg cacgctatat accggcccca gtttacgatg aaagccgttc gccagttcct 300 

gcgcctgatg aatcgcgtta tcaatcgctg ccttacgcgc tttgtcttta taggcatccg 360 

gctgcgccac gcccagcgac acagaacgaa tt 392 



<210> 31 
<211> 351 
<212> DNA 
<213> E. Coli 



<400> 31 

ctatccttga tgaaaccgcg agcaaagata ggtgattacg tcatggtttt acagaaaatt 60 

acagaaaaag gaggcaatat cgggtaaagg cattagcccg acgaatacgt cgggctacaa 120 

atattattgt gctgcaggtg ttttagcggg ttgttgatcc acaggttcta actggaagac 180 

cacatcgacc tgatcatcaa actgaatagc ggcctgctcg taagtttcct gggcggacac 240 

cggcgcggca tcggctttca tcatccgcac cattgggctg ggctgatagt tggaaacatg 300 

gtagcgcacg ctatataccg gccccagttt acgatgaaag ccgttcgcca g 351 



<210> 32 
<211> 762 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc__f eature 
<222> (1) . , . (762) 
<223> n = A,T,C or G 



<400> 32 

aattatgaaa cactgtctgg aatcgtctga atgacgggca catttgcgag cacgcatcca 60 

gtaataacac aggaaactat tttatctacg cgttagcgat agactgcttg catggcgaaa 120 

ggaggtaagc cgacgatttc agcgggacgc tgaaacggga aagcccctcc cgaggaaggg 180 

gccataaata aggaaagggt catgatgaag ctactcatca tcgtggtgct cttagtcata 240 

agcttccccg cttactaaga ctaccagggc gggggaaacc ccgctctacc ctcactcctg 300 

aaagtatgcc ttcacgataa gattgtcaat ccgcaggctt tgtagtctgc gatcctgcca 360 

gcaaatattc tttgcgagtc gttacgcaat aatcacagag gaaactattt tattcacgcg 420 

ttagcgatag actgcattca gggcgaaagg aggtaagccg atgatttcag cgggacgctg 480 

aaacgggaaa gcctctcccg gagaagaggg cttttaataa ggaaagggtt atgatgaagc 54 0 

acgtcatcat actggtgata ctcttagtga ttagcttcca ggcttactaa gaacaccagg 600 

gggaggggga aacctcttcc taaccctcac ttctgaaatt gggtgctatg acgctggcgt 660 

tactgcttan cgctaccagt ttgtctgccc tggcggttgt aacgccagat cggtacccgt 720 

ttggatattt taatgaaagc cgacaaatca atcancgtga eg 7 62 



<210> 33 
<211> 293 
<212> DNA 
<213> E. Coli 



<400> 33 

gcacatttgc gagcacgcat ccagtaataa cacaggaaac tattttatct acgcgttagc 60 

gatagactgc ttgcatggcg aaaggaggta agccgacgat ttcagcggga cgctgaaacg 120 

ggaaagcccc tcccgaggaa ggggccataa ataaggaaag ggtcatgatg aagctactca 180 
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tcatcgtggt gctcttagtc ataagcttcc ccgcttacta agactaccag ggcgggggaa 
accccgctct accctcactc ctgaaagtat gccttcacga taagattgtc aat 



240 
293 



<210> 34 
<211> 633 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (633) 
<223> n = A,T,C or G 

<400> 34 

atttacactt tttacgaaat catgggatca ctaacaaaat atcgcttgtc agttatattg 60 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 120 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 180 

cagccaccgc gggttcgacc aatacctcgg gaatttctaa gtatgagtta agtagtttca 240 

ttgctgactt taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 300 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggacgcact 360 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac cgacggtaaa atcattaaag 420 

cctacgacgg tgagattttt tatcatcgct aaaaaaagcc ccctcatcat gagggggaaa 480 

tgcagacacc ttgntatttt ttattattag ccacttgctc gtcttgcttg gtattaagtc 540 

gtatttcacg ttgattaatg cnggtggctc cagtgcgcca gattaacttt gtttggatcg 600 

aagacgtagt aactggctgg ttatcggaat tgg 633 

<210> 35 
<211> 569 
<212> DNA 
<213> E. Coli 

<400> 35 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 60 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 120 

cagccaccgc gggttcgacc aatacctcgg gaatttctaa gtatgagtta agtagtttca 180 

ttgctgactt taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 24 0 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggacgcact 300 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac cgacggtaaa atcattaaag 360 

cctacgacgg tgagattttt tatcatcgct aaaaaaagcc ccctcatcat gagggggaaa 420 

tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 480 

tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 540 

tagacgtagt aactggctgg tatcggaat 569 

<210> 36 
<211> 338 
<212> DNA 
<213> E. Coli 

<400> 36 

cgtattcaca tccttttgat tggtgataac atgcgaatcg gtattatttt tccggttgta 60 

atcttcatta cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg 120 

ggagcataaa gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac 180 

tcggcactgc ctgggatggt ggtaacgtca cctctaaaaa atagcaaagg ctgcctgtgt 240 

gcagcctttg tgcaatttaa gcgttaactt ttaatcttcc tgtagataaa tagcacgaca 300 

atcgcaccaa taacggcaac cacgaagctg ccaaaatt 338 

<210> 37 
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<211> 375 
<212> DNA 
<213> E. Coli 



<400> 37 



ctgaatattt aaaaaggaaa 
ataaaatatt tccgtggagc 
caaagaagca ttgaatgcag 
ttttaattta acatgtaaac 
atacataaat ggagtcatgt 
tccatctcta attgcatatt 



acgacatgaa accgaagcac agaatcaaca ttctccaatc 
attttattat tgaatataga ggtttaactc cggtaaaaaa 
ggaaaaataa tatggccata aaaaacatcg aaagaaactc 
gcatggttaa tcctcatatc acgggtggag tgttaagaac 
tttccctttt ccatttatca agttcctgtt gccgttttag 
ttaatttttc tgataaatgg cattgagcat cgatttcatt 



60 
120 
180 
240 
300 
360 
375 



taaaacaact gtaca -^/-^ 

<210> 38 
<211> 446 
<212> DNA 
<213> E. Coli 

<400> 38 

ttacgatagc tattagtaaa aatataagag ttagctgtat tgttatgtct gtggcgaaat 60 

tgactacctt cgtttttttg attaagaatg attttattat cgtaagtaaa attacatgaa 120 

tatttaaaaa ggaaaacgac atgaaaccga agcacagaat caacattctc caatcataaa 180 

atatttccgt ggagcatttt attattgaat atagaggttt aactccggta aaaaacaaag 240 

aagcattgaa tgcagggaaa aataatatgg ccataaaaaa catcgaaaga aactctttta 300 

atttaacatg taaacgcatg gttaatcctc atatcacggg tggagtgtta agaacataca 360 

taaatggagt catgttttcc cttttccatt tatcaagttc ctgttgccgt tttagtccat 420 

ctctaattgc atattttaat ttttct 446 

<210> 39 

<211> 392 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (392) 

<223> n = A,T,C or G 

<400> 39 

tcaccccggt gccgattttc aggcatcctg atttaactta gcacccgcaa cttaactaca 60 

ggaaaacaaa gagataaatg tctaatcctg atgcaaatcg agccgatttt ttaatcttta 120 

cggactttta cccgcctggt ttattaattg cactgtnatc cgggcgttcg cccgctttaa 180 

tcacaatagg ctgtgtagcc tgggcctgtt tctctttcac ccgcgccaga gcggcagcaa 240 

tcgcatcttt atctttggct gcaggttgaa cggctgcgct cttatgtcgt tcaaggcgag 300 

ccgctttttc gcgctccaga cgagcctggc gcgcttcgaa acgcgctttg gcttctgcgg 360 

cncgcttttc ttcctgacga atagccgcaa tt 392 

<210> 40 
<211> 208 
<212> DNA 
<213> E. Coli 

<400> 40 

taataacgct atctgcggat aaagcagaat aggtggttaa ccccagacat aaaccgagga 60 

aaataatgtt attgtatttc ataatctatt gttccttagc gacagattgc tgtctgctgg 120 

ttcagtaagg taccaggaga aacttcagga agcttgtact cgacaataca gtttgagttt 180 

ttatctttgc cccatgaaac ctgtaatt 208 
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<210> 41 
<211> 342 
<212> DNA 
<213> E. Coli 



<400> 41 



catcctcaat 
acgtaatctg 
tgtttttttg 
cgaggacgcg 
accccagaca 
cgacagattg 



accgttaaat 
aaaagggacg 
gagatctacg 
cccgctaatt 
taaaccgagg 
ctgtctgctg 



gcaacccgaa 
gctggacttg 
gtaaaattaa 
ttaataacgc 
aaaataatgt 
gttcagtaag 



cccccgttgt 
tgctaccggt 
gcgaatccga 
tatctgcgga 
tattgtattt 
gtaccaggag 



ccctttgctg 
cgttggaaat 
tgagactgtg 
taaagcagaa 
cataatctat 
aa 



cattcactta 
tgtctggcac 
cagccataat 
taggtggtta 
tgttccttag 



60 
120 
180 
240 
300 
342 



<210> 42 
<211> 841 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (841) 
<223> n = A,T,C or G 



<400> 42 

agatttactg ccaatttccg gcagatcgga aagggttaaa ccatattgat ccataagggt 60 

acgaatcacg gctataccgc caggcatggc ttgagccatg gcattaaatt ccgcaaattc 120 

gggcgctgat tcttcccacg cggttatttt ggcacacacc agatccagca aggggttntc 180 

aggatcgttg agcagcagat gatctaccag ttncagcgcc tgggtgtatt gntccttgtt 240 

ctgaataccc gnnagaaaag gtgccacagc anttagcttn tctcctgctt gcaagatgtc 300 

tggcaatngc aatcattttt tgcacttant acgatgnaca ncngtaaaga aatcgnattt 360 

ttntatgccg tcataacttt acgtatgtan cactttttgc nattcnaaaa aagaccattn 420 

gctncaacac gtaaatttna ttgnccccna catttanaac ataaatgntt aaaattttcc 480 

ccccncnnan ttttaagntn ttnanagaat ngggaattac ctgcttttna atgnactcan 540 

anttttttng naataattcc tntatcnaan ctnnttttcn cccaanagnc nnccaaattn 600 

cggtttnntn nttnncnngg cnttttttta cccnanaann tttattcaan nccttttttg 660 

tagnctattt naagnggnct ttnttnnatt aactttccnn ttggncaaat tttggcnnat 720 

ttttatatan aattntctta tntcntaatt tnggnanccc cngatgnaan tttatggngg 780 

gantcccnnt ccctntttaa tnnatgntct gggntatttt taaancctnn attaannnan 840 

c 841 



<210> 43 
<211> 215 
<212> DNA 
<213> E. Coli 



<400> 43 

aataactttt cgttaggcag ttttgggtgt gagttgcaag aggggagact actgaataac 60 

tcaagtttta taatcgaggg gaaaatggtg atggcgttca tagcaaaacg ccctcaacca 120 

taaaggtcga gggcgcttaa gatgttaaaa acccgctatc cgttaaaaaa caatgttcaa 180 

ctaaggtcag tgacattgcg ctaaaaaagc gaatt 215 

<210> 44 
<211> 395 
<212> DNA 
<213> E. Coli 
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<220> 

<221> misc_feature 
<222> (1) . . . (395) 
<223> n = A,T,C or G 



<400> 44 

gcattattca tgagaaatgt gtatcgtaaa tcaactgaaa ttaacgcaac catttgttat 60 

ttaaggttta attatctgtg tgtgatattt tattgaatgt tttaaatatt gtttttattg 120 

gcattgctat aatattggtt atcatttgct gaatggattc agtcttaatg agtgggtttt 180 

taagggacag gcatagagta atgatacgta tgcataacca acatctttac tcattatgtc 240 

attgaatgtt gaccctatgt gtttatgaag gagaggtatt ttcagttgat ctggattgnt 300 

aaattcatat aatgcgcctt tgctcatgaa tggatgccag tatgtagtgg gaaattataa 360 

atattgaaat agtccaacta cttctttatt accaa 395 



<210> 45 
<211> 883 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (883) 
<223> n = A,T,C or G 



<400> 45 

ataatcaggt aagaaaaggt gcgcggagat taccgtgtgt tgcgatatat tttttagttt 60 

cgcgtggcaa tacatcagtg gcaataaaac gacatatcca gaaaaatata cactaagtga 120 

atgatatctt ccgatttatc ttaatcgttt atggataacg gcaaagggct tcgttttttc 180 

ctatacttat tcagcactca caaataaagg aacgccaatg aaaattatac tctgggctgt 240 

attgattatt ttcctgattg ggctactggt ggtgactggc gtatttaaga tgatatttta 300 

aaattaatta atgtcatcag gtccgaaaat aacgagaata tttcagtctc tcatcctgtt 360 

gcgctcctgt catgtgcatt gcttcatata atcactggcg caaggagcgc cgcaggcgna 420 

gnntgcncgn cgncccacct naccccatgc cgaacttcag aantgaaaac nccntaacnc 480 

cgatngtcgg cgggngcctc cccatgcnan agtangggaa ntgccangcg ncnnattaaa 54 0 

cgaaaggctn attncaaaga ctgggccttn cntttatctg atgtttgtcg gagaacgctc 600 

tcctgagnan gacaaatncc gccgggagcg gatttgaacn ttgcgaagca accgncccna 660 

agggngnngt cntgacnccc nnctctanct nncngccttc ttttgcttna angncctcct 720 

ancngatggc ctttttngcc ntctaccaaa cnntttggtt aatgcttnta aaancctttc 780 

cannntncaa tccngtnntn cccatccnnn tnntgaaagn ntncctnccn tgtncantnt 84 0 

anntnngggg gnngngngcc ggcggncccc cccccccccc ccc 883 



<210> 46 
<211> 1024 
<212> DNA 
<213> E. Coli 

<220> 

<221> inisc__feature 
<222> (1) . . . (1024) 
<223> n - A,T,C or G 



<400> 46 

gtttatggat aacggcaaag ggcttcgttt tttcctatac ttattcagca ctcacaaata 60 

aaggaacgcc aatgaaaatt atactctggg ctgtattgat tattttcctg attgggctac 120 

tggtggtgac tggcgtattt aagatgatat tttaaaatta attaatgtca tcaggtccga 180 

aaataacgag aatatttcag tctctcatcc tgttgcgctc ctgtcatgtg cattgcttca 240 

tataatcact ggcgcaagga gcgcgcagag tnctccnant nnnnntnntt ntntnnctnn 300 
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nccttcacna tncnnccncn nantnnatag nncaccnntn ttnntcnnnn gnccncctcc 360 

nnncnnnnnn ncatnnnatc ccactnnntt tnctccannn nnncnnnntn canccnacaa 420 

antncnaccn anntnacctt atacnnannc nancnnnnnn nnccactctn nctcgnnctc 480 

cccnttcnac nnccannnnn cancnntcnn ctnnnnccct nncntaattn ttctnnctan 54 0 

ntcctanccn cnnacnnncc cancnatccn nnnatacant cnattnntnn cnntcncntn 600 

cnccnnttcc nnctnnncnc tnccncatnc ccnnnannan canntncccc ncctncctna 660 

ccncncncnc ccnccatccc nnnccnncnt ccnnantnga caannnnaat cncnnnnncn 720 

nnnnnncnnn tnnncncccn gcncnnccnt nccntcacnc tnnncnncta nannnnntac 7 80 

nntnaccnnt cctnncacnc tnccctnnng antccnacna ntnnnnnanc nanaacnctn 840 

tnnnnccata atcccacacc acncccntnc ancntntnnt ncntcntccc ttcntatcnc 900 

agctnnnnnt nctntnnnnc tnccncccnn cnnactncnn nnacccncnn cccantcagt 960 

ccaccntccn cnncnnnntn nnncnancan ctnncacncn cnantaacct nntnncacct 1020 

tccc 1024 

<210> 47 
<211> 236 
<212> DNA 
<213> E. Coli 

<400> 47 

atatacacta agtgaatgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 60 

gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 120 

tatactctgg gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 180 

taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 236 

<210> 48 

<211> 418 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (418) 
<223> n = A,T,C or G 



<400> 48 

cggagattac cgtgtgttgc gatatatttt ttagtttcgc gtggcaatac atcagtggca 60 

ataaaacgac atatccagaa aaatatacac taagtgaatg atatcttccg attnatctta 120 

ntcgtttatg gataacggca aagggcttcg ttttttccta tacttattca gcactcacaa 180 

ataaaggaac gccaatgaaa attatactct gggctgtatt gattattttc ctgattgggc 240 

tactggtggt gactggcgta tttaagatga tattttaaaa ttaattaatg tcatcaggtc 300 

cgaaaataac gagaatattt cagtctctca tcctgttgcg ctcctgtcat gtgcattgct 360 

tcatataatc actggcgcaa ggagcgcgca nggggcggcc aatcgccgcc ggcccctg 418 



<210> 49 
<211> 550 
<212> DNA 
<213> E. Coli 



<400> 49 

ctgctagtta cagggaacac taatgacaga cagctaaaag ccctgtttaa ttacgtatta 60 

caaacagggg atgcccagcg ttttcgtgca tttattggtg agatagcgga acgcgcacca 120 

caagaaaagg agaaactgat gaccattgct gacagattac gtgaagaagg cgcaatgcag 180 

ggcaaacacg aagaagccct gcgtattgct caggagatgc tggatagagg tttagacaga 240 

gagttagtta tgatggtgac ccgactttca ccagacgatc ttatcgcgca aagccactaa 300 

tcctgtaaca ccgggagtta actggcggat gtttgctgta aaccacatca gcgaacgaca 360 

tccgccagcg cctcttctaa atcgtaccag cgaaacgcaa aacccgcttc ttccagccgt 420 
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ttaggcagcg cgcgttgtcc acctaatacc agtactgaag attcgcccat taacagtcga 
atggcggtcg cggggacgcg caaaatggcc gggcgatgca gcgcatgacc gagcgcatgg 
gcaaattgtt 



480 
540 
550 



<210> 50 
<211> 99 
<212> DNA 
<213> E. Coli 



<400> 50 

ttggcatctc ggtgttgccg atcttcatga tatccagccc gccggaaact tcttcccaaa 60 
cggttttgct gttatccatt gagtcacgga actgcccct 99 

<210> 51 
<211> 259 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (259) 
<223> n - A, T,C or G 



<400> 51 

ccgtgccgag atgatcctgt naccatcatc cgttgtgaag tagtgattca cgacttcaag 

gcgcttttca aaagggtatt ttggctttga catattaggg gctattccat ttcatcgncc 

aacaaaatgg gtgcagtaca tactcnttgg aaatcaacac aggaggctgg gaatgccgca 

gaaatataga ttactttctt taatagtgat ntgtttcacg cttttatttt tnaaanaagt 
tnggcttact tcccgggnn 



60 
120 
180 
240 
259 



<210> 52 
<211> 877 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (877) 
<223> n = A,T,C or G 



<400> 
cagcagagcg 
acgaacgcgg 
catgctgtag 
acgaggtaca 
agttacttta 
tgctttggtg 
cagcagtttt 
ccgagggact 
gtctttgtag 
tgagtaggac 
ggcgggcagg 
cggatggcct 
tgtatccgnt 
taaagtttta 
ccctatntta 



52 

cggccttctt 
tcgactttat 
ttaccacgac 
gcaatagtga 
cagccgatcg 
atcagcggtt 
ttgtcagcga 
tgcatgacag 
taatcatgca 
aaatccgccg 
acgcccgcca 
ttttgcgttt 
catcccatcc 
tggngttaaa 
acgaactngg 



cgtcagattt 
cgtagtcgat 
cgtcgaaaga 
tcaggcgctc 
gatagccctg 
tttgaccgga 
tcgcttcacc 
aattgtagtt 
gtttcgccat 
ggagcggatt 
taaactgcca 
ctacaaactc 
tatcgatgat 
cttgggctgg 
ctantttngg 



cgcagtagtg 
ttctgggaag 
cttagcggac 
aaagaactcc 
acggattttg 
gattgctgcc 
aacacccatg 
aaactcagtc 
cgtactactc 
tgaacgttgc 
ggcatcaaat 
ttttggttat 
aagctgtcaa 
cagnttncca 
tcaatcn 



gtaatggtaa 
atgatctgct 
aggccacgga 
cacatgcgtt 
aagcctgcaa 
aggtctgctg 
ttcagggtga 
atgagttttt 
catgtcggtg 
gaagcaacgg 
taagcagaag 
ttttctaaat 
acatgagaat 
atggcttaat 



tatccaaacc 
cacggacacc 
agtcacggat 
cgccacgcag 
cagatttgcg 
ctgcgttatc 
tcttctcgac 
taactacttc 
aacgctctcc 
cccggagggt 
gccatcctga 
cattcaaata 
ttaatcaatc 
cagtngaggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
877 
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<210> 53 
<211> 291 
<212> DNA 
<213> E. Coli 

<400> 53 

tgaacagcag agatacggcc agtgcggcca atgttttttg tcctttaaac ataacagagt 60 

cctttaagga tatagaatag gggtatagct acgccagaat atcgtatttg attattgcta 120 

gtttttagtt ttgcttaaaa atattgttag ttttattaaa tgcaaaacta aattattggt 180 

atcatgaatt tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 240 

tagggttata aatgcgacta ccatgaagtt tttaattgaa agtattgggt t 291 

<210> 54 
<211> 282 
<212> DNA 
<213> E. Coli 

<400> 54 

ttattaaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 60 

aggggggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 120 

taattgaaag tattgggttg ctgataattt gagctgttct attcttttta aatatctata 180 

taggtctgtt aatggatttt atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 240 

ccgccatatg aacggcgggt taaaatattt acaacttagc aa 282 

<210> 55 
<211> 293 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc__f eature 
<222> (1) . . . (293) 
<223> n = A,T,C or G 

<400> 55 

cggggtccgg cgctcatcaa caatcggggg gcagcaaggg gctgaaacgg gaaagcccct 60 

cccgaagaag gggccttgta taaggaaagg gttatgatga agctcgtcat catactggtt 120 

gtgtngttac tgttaagttt cccgacttac taacaactca tcagaggggg gagaaatcct 180 

cccttaccct tgttccttta ctctaggttg aaaaaacaac agcgtcaata ggcctgccat 240 

gtacgaagcg agatctgtga accgctttcc ggttagcctt ttttatcctg ttg 293 

<210> 56 

<211> 300 

<212> DNA 

<213> E. Coli 



<400> 56 



tctgcgttcc gctaaaaggt gcaaatgctc 
ggggaaggca aaattgcctc tgggaaagca 
ggggggcagc aaggggctga aacgggaaag 
aaagggttat gatgaagctc gtcatcatac 
cttactaaca actcatcaga ggggggagaa 



aggacgttgc agcgttttgc gtgaccgctc 
ttgcgcgggg tccggcgctc atcaacaatc 
cccctcccga agaaggggcc ttgtataagg 
tggttgtgtt gttactgtta agtttcccga 
atcctccctt acccttgttc ctttactcta 



60 
120 
180 
240 
300 



<210> 57 
<211> 359 
<212> DNA 



<213> E. Coli 



-17- 



<400> 57 

caacacagga ggctgggaat gccgcagaaa tatagattac tttctttaat agtgatttgt 60 

ttcacgcttt tatttttcac ctggatgata agagattcac tgtgtgaatt gcatattaaa 120 

caggagagtt atgagctggc ggcgttttta gcctgcaaat tgaaagagta agagtcttcg 180 

gcgggaaatt attcccgcct tacttacggc gttgcgcatt ctcattgcac ccaaatttat 240 

tcttcacaaa aataataata gattttatta cgcgatcgat tatttatttc ctgaaaacaa 300 

ataaaaaaat ccccgccaaa tggcagggat cttagattct gtgcttttaa gcagagatt 359 



<210> 58 
<211> 700 
<212> DNA 
<213> E. Coli 

<220> 

<221> mis cofeature 
<222> (1) . . . (700) 
<223> n - A,T,C or G 



<400> 58 

aaaccttttt ctcctgtttt tcatagaggg caacccatgt cctgacctgg gttcggggga 60 

caccaaaacg tgccgagatg atcctgtaac catcatcagt tgtgaagtag tgattcacga 120 

cttcaaggcg cttttcaaaa gggtattttg gctttgacat attaggggct attccatttc 180 

atcgtccaac aaaatgggtg cagtacatac tcgttggaaa tcaacacagg aggctgggaa 240 

tgccgcagaa atatagatta ctttctttaa tagtgatttg tttcacgctt ttatttttca 300 

cctggatgat aagagattca ctgtgtgaat tgcatattaa acaggagagt tatgagctgg 360 

cggcgttttt agcctgcaaa ttgaaagagt aagagtcttc ggcgggaaat tattcccgcc 420 

ttacttacgg cgttgcgcat tctcattgca cccaaattta ttcttcacaa aaataataat 480 

agattttatt acgcgatcga ttatttattt cctgaaaaca aataanaaaa tccccgccaa 540 

atggcaggga tcttagattc tgtgctttta agcagagatt acaggctggt tacgttacca 600 

gctgccgggc ctttaacgcc gctttcgatg gtgaaggaca ctttctgacc ttcgtccaga 660 

gattgtaacc atcggtctgg atagccnaga aatgtccaac 700 



<210> 59 
<211> 631 
<212> DNA 
<213> E. Coli 

<220> 

<221> inisc_feature 
<222> (1) . . . (631) 
<223> n = A, T,C or G 



<400> 59 

tggtggcatt ggttgctgga gagagaaaac ccccgcacgt tgcaggtatg cacctgacaa 60 

caccacgggg gctaatcttg actctagacc actcaagaat agccgcgaaa cgttgtcatt 120 

acaacacagg cggctatatg acgttcgcag agctgggcat ggccttctgg catgatttag 180 

cggctccggt cattgctggc attcttgcca gtatgatcgt gaactggctg aacaagcgga 240 

agtaacgtgt catgcgggcg tcaggctgcc gtaatggcaa tttgcgcccg gaccaggccg 300 

caggggggaa actctgcggc ctttttcgtt cttactgcgg gtaaggcacc cagtcgccgc 360 

cgttcaggcg aacgtacggt ttatcctggt attgaataac tactgcattt gagttctcgg 420 

agaccggtgc tgtttgtggc aacccactgg tgagtttttt ccagtcaaca ttgtcttcgg 480 

tgaaaatctt gccatcgaga acgcgaacca ccagatcgga gatagccagg aagctgctcg 54 0 

gttgttcgat gacaatcggt gccccctgat gcggtgcctt catgccgaag aatttcaccc 600 

caacggggac gtcngtgata gaccgggcta g 631 



<210> 60 
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<211> 648 
<212> DNA 
<213> E. Coli 

<220> 

<221> inisc__f eature 
<222> (1) . . . (648) 
<223> n = A,T,C or G 



<400> 60 

ggctcaggcn tgctgattgt ttttttgtgc aatggcccng tattagcgtc gttgctgtcg 60 

atggagagaa tcataaacgt ggtgaatgat gattgttagc aaggaaaact gtcaaaaatc 120 

ttcaaaaaat ttgagggata aggccggaat ggctccggcc agagggaagt taaccgcgaa 180 

gctgttgctg cttgagggtc gttttaacca gacgccaggc gctccatacg ccaaaaccgc 240 

gtctggccca gcggaccagc atattaggat ggcgaatcgt ccagatcgcc atcacgctac 300 

tgccaaccag cgcccaggag cgcagactta gcagcatatt ccancgacga tcgtaagcgc 360 

ctgttgtctc cagccattca cgacgactgg cggaagggnc cgcgnctgac caacttgnct 420 

tttagnctga tncanattan attnataaac gcagnanncn ggtntgatta atcntatttn 480 

gctctngtct ggtagttagc nncggnnngt ctcnttntna cccnnttcnn tttannttac 54 0 

natnngtaan ttatntttnt nngtctnant tntanttgng tactntaagt ntatncgnnn 600 

atnntnnnan nnnncagnnc ntntttttta aatnntttnt nanncnnc 648 



<210> 61 
<211> 737 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (737) 
<223> n = A, T,C or G 



<400> 61 

tgctaatatc tttctcattg agatgaaaat taaggtaagc gaggaaacac accacaccat 60 

aaacggaggc aaataatgct gggtaatatg aatgttttta tggccgtact gggaataatt 120 

ttattttctg gttttctggc cgcgtatttc agccacaaat gggatgacta atgaacggag 180 

ataatccctc acctaaccgg ccccttgtta cagttgtgta caaggggcct gatttttatg 240 

acggcgaaaa aaaaccgcca gtaaaccggc ggtgaatgct tgcatggata gatttgtgtt 300 

ttgcttttac gctaacaggc attttcctgc actgataacg aatcgttgac acagtagcat 360 

cagttttctc aatgaatgtt aaacggagct taaactcggt taatcacatt ttgttcgtca 420 

ataaacatgc agcgatttct tccggtttgc ttaccctcat acattgcccg gtccgctctt 480 

ccaatgacca catccagagg ctcttcagga aatgcgcgac tcacacctgc tgtcacggta 54 0 

atgttgatat gcccttcaga atgtgtgatg gcatggttat cgactaactg gcaaattctg 600 

acacctgcac gacatgcttc ttcatcatta gccgctttga caataatgat aaattcttcg 660 

cccccgtagc gataaaccgt ttcgtaatna cgcgtccaac tgggntaagt aaagttgcca 720 

gggtgccgta atcttac 737 



<210> 62 
<211> 648 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (648) 
<223> n = A,T,C or G 
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<400> 62 

tgcttttgaa tatgtgctcg caatcttgag aaggaaatgg cgaccacgaa agaaaaggca 60 

aaaaccgata atctgaaaga acccaagtat ttcagtataa gcattgaatg ccgaccagta 120 

aactctttcg gattcaccca gaaagtgaan ccaaaatgat aatcgtatac ataagtcttt 180 

cgagtggctc gttagcaaaa agtttcaaca atggagtaaa tacatccaac atatcaataa 240 

ctctcaactg taaggggatt gaaatggtaa ccccagctct tcgcttgagg ggtatagccg 300 

agaccaccga agccccggag gtggtgaaat aaaaccgggc acaacacgaa agggcgcatt 360 

tccgatatcc ataaaagaag tcgggtcttt gtctggtaaa attaaattgg tgggaagtgc 420 

gcctccgggt tgtaaatacc gactttgctg ggtgtagcct ggcggcatca agtttttttc 480 

tggaagttcg ctgatgtccg ccctttttaa agggaatttt ggtgatgccg gtgaatgccg 540 

cttaaccccc cgtgggccca gttaaaagtc atggtaagnc ctaatnggtt tggggtggga 600 

aaagccnact gnnaattggt tacctggttt gcaagtancc ctggaagg 64 8 



<210> 63 
<211> 237 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (237) 
<223> n = A,T,C or G 



<400> 63 

ggtgtttant tacaagagat tcatctttgt ntaaancccn gataagtaat tacgcataaa 60 

acaacaatga ttataatagc aaaaataaat attatcatct ttgatagatt acttgagata 120 

gccagcatct tgtaaagcct ttatcgtttt tttatgctct ggattaatat aatcactaca 180 

tctatctgag caatctgttg ttgatggaca tgtcaaccca tggtcattta cagccaa 237 



<210> 64 
<211> 427 
<212> DNA 
<213> E. Coli 



<400> 64 

gataattaga gtttgtcgtc agaaaattga cgttacccat aacaaatgaa aggccaggta 60 

aatcatgcca ttagtcattg ttgctatcgg tgtaatcttg ttgttgctcc tgatgatccg 120 

cttcaaaatg aacggcttca tcgctctcgt cctcgtggcg cttgctgttg gattaatgca 180 

aggaatgccg ctggataaag ttattggctc catcaaagcc ggtgtcggcg ggacgctcgg 240 

tagccttgcc ctgatcatgg gttttggcgc aatgctgggc aaaatgctgg cagactgcgg 300 

tggcgcacaa cgtatcgcca ccacgctgat tgccaaattt ggtaaaaaac acatccagtg 360 

ggcggtggta ctgaccggtt ttaccgttgg ttttgccctg ttctatgaag tgggctttgt 420 

gctgatg 427 



<210> 65 
<211> 261 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (261) 
<223> n = A,T,C or G 

<400> 65 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 60 
aattaatgca attatcgaaa attcaaaaaa tatccaaaaa tngtatactt tattccagaa 120 
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gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgctcat 180 
tacattgtga gcttcatctt tatttaattt tctgttgact ccagctctcc gtgataacgg 240 
ttttataatt agatgcttat c 261 



<210> 66 
<211> 98 
<212> DNA 
<213> E. Coll 

<400> 66 

agatgattgc cgggaacttg ttagcggcac gcaggcggcg gctcgcaccc ttaccctgct 60 
ctttacgtac ttctgcgttg atagtaaaca tttctttc 98 

<210> 67 
<211> 260 
<212> DNA 
<213> E. Coli 



<210> 68 

<211> 95 

<212> DNA 

<213> E. Coli 

<400> 68 

aaaaacggcg taaagaaagg ttgcaaacat gttaataaaa actcaaattg atcccacgta 60 
tatattacgc cgcaaaatcc ttacaataaa caggg 95 

<210> 69 
<211> 174 
<212> DNA 
<213> E. Coli 

<400> 69 

ttaattatta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 60 

agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 120 

acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 174 

<210> 70 
<211> 138 
<212> DNA 
<213> E. Coli 

<400> 70 

agtctgtaaa aacgtcaaaa agagtgtttt atcaacagaa gaatggaggt ctgacagata 60 

gtagtaatgc aaaaaaatgg agacttaagt tgaatgaacg ggagtaaagc gaaaagacta 120 

tagagtgaag gagaaatt 138 



<400> 67 



aagcgcgaac gaagtcgatg tgctgcagct tcggtttgta cgggtgacgc tgtacgtcct 
gagctttaac tttgatttct ttaccgtcaa caacgatggt cagaacttcg ctgtagaatt 
cagctttagc ttgcatgttc atgactttgt cgtgatccag ctcgatagcc agcggcgctt 
ctttgccacc gtagatgatt gccgggaact tgttagcggc acgcaggcgg cggctcgcac 
ccttaccctg ctctttacgt 



60 
120 
180 
240 
260 



<210> 71 
<211> 191 
<212> DNA 
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<213> Coli 



<400> 71 

tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg 60 

gtttttggag agaagaatga ggaagatgcg tcgagccaca gaaacgttag ctttacatat 120 

agcggaggtg atgtgaattt aatttacaat agaaataatt tacatatcaa acagttagat 180 

gctttttgtc g 191 



<210> 72 
<211> 244 
<212> DNA 
<213> E. Coli 



<400> 72 

ggccatttat acaggaaaag cctatgtcag aacgtaaaaa ctcaaaatca cgccgtaatt 60 

atctcgttaa atgttcctgc ccaaactgca cccaagagtc agaacacagt ttttcaagag 120 

tacaaaaagg tgcccttttg atctgccctc attgcaacaa agtattccag acaaatctta 180 

aagctgtagc ctgattgatt ttattagtaa caagtatttt ttatatttta ataatatatt 240 

taaa 244 



<210> 73 
<211> 327 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (327) 
<223> n = A,T,C or G 



<400> 73 

aaattttcag gtaccttgtc accatacttt tttttctgag cattaatgat attttgagct 60 

tcttgaggat ctttaactcc ccacatttgg tggaaagtat tcatattaaa aggaaggntg 120 

aataatttgn ctttataaat cgccagtgga gaattagtaa aacgattaaa ttctactaaa 180 

tnattaaccg naaaaaaatt cccatatata tttatcattg gtatgaaaaa tatgtgcacc 240 

atatttatga atntggatac cctnacagtc ctctgtgtac gcatttccac cgatatgatt 300 

tcttttctna atcactaaaa ctttttt 327 



<210> 74 
<211> 150 
<212> DNA 
<213> E. Coli 



<400> 74 

gcagtgatcg aagcgatgac gaagtgtatg gaaaaatcag aaaaactcag caaatcctga 60 

tgactttcgc cggacgtcag gccgccactt cggtgcggtt acgtccggct ttctttgctt 120 

tgtaaagcgc caaatctgcc gatttcaacc 150 



<210> 75 
<211> 330 
<212> DNA 
<213> E. Coli 



<400> 75 

gaaagtatct tcgttattga catcactgga aaatataact tgcttttcat tattaaactc 60 

gaagcgcgta ccgtatctgg acaaacattt atcgagctta ccaaattcct gaagaggttt 120 

aactacagat aacatttgcg cgtcctttgc agtaatgccc gtcaaatcct tgacgggcat 180 
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tatttagatt aaattaccag tatttcttcg gagtgaagaa tattaccagg tatatttaac 240 
acccacgttc gcggaccagt cttgatctac gtcaccacca ccgaggtagt tagcatcggt 300 
ataggcgctg aagttcttgg tgaagctaaa 330 



<210> 76 

<211> 194 

<212> DNA 

<213> E. Coli 

<400> 76 

tgtttttttc cagcaacgga gcaaaaggtt tgcccttgtg cagctcaggg ttaaccactt 60 

taactacgtg gcgacgaccc ggagatgtcg gtttacattt aacaactgcc attgtattac 120 

tcctccgact tactcagcgc cgccaacgaa gtccagattc tggccttctt tcagggtgac 180 

gtaagctttt ttcc 194 

<210> 77 
<211> 188 
<212> DNA 
<213> E, Coli 

<400> 77 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 60 

cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtactatg gtgttggatt 120 

tttccatcgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 180 

gttcttca 188 

<210> 78 
<211> 173 
<212> DNA 
<213> E. Coli 



<400> 78 

acaaaggcga acaaagcctg tgaagcccga aggctccaca gacagtgcta cttgaaggcc 60 

ttactgtttc ttcttaggag cgagcaccat gatcatctgg cggccttcga tcttggttgg 120 

gaaggattcg accactgcca gttcttgcaa atcgtctttc acgcgattaa gca 173 



<210> 79 
<211> 272 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (272) 
<223> n = A, T, C or G 



<400> 79 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 60 

gaactgatcc cgctttctgc ttcaagcttc tgaactggat acggaaacgt aatnagggct 120 

aaagaagaca ctactcttag ccctttaaca tttaacgcat tgtcacgaac tcttctgccg 180 

ccgttgggtg aatggcgacg ggtattggtc gaaatctttt ttgggtggcc ccatctttaa 240 

cgcccacccg cgaaaccctg caacatttcg tc 272 



<210> 80 
<211> 259 
<212> DNA 
<213> E. Coli 
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<400> 80 



cgcaggcagc tgatggtcaa caggatgaga gaaacccaga gacaggttaa tcacattgcc 
tttaaccgct gcacggtaac ctacaccaac cagctgcagc ttcttagtga agccttcggt 
aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 
tgcgtaacca tcacgcggac cgaaggtcag ggtattatct gcatgtttaa cttcaacagc 
atcgttgaga gtacgagtc 



60 
120 
180 
240 
259 



<210> 81 

<211> 73 

<212> DNA 

<213> E. Coli 

<400> 81 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 
ccgtttaccg ggg 73 

<210> 82 
<211> 666 
<212> DNA 
<213> E. Coli 

<400> 82 

atgaacgttt tctcgcaaac tcaacgctat aaggcgttgt tctggttatc gttatttcat 60 

ctgctggtga tcacctccag taactatctg gttcagcttc ccgtctccat tttgggtttc 120 

cataccacct ggggcgcgtt tagctttccg tttatttttc ttgctaccga cctgaccgtg 180 

cgtatttttg gcgcaccgct ggcccgacgc attatcttcg cggtaatgat ccctgcgtta 240 

ttaatctcct acgtcatctc gtcgctattc tatatgggtt cctggcaggg attcggcgca 300 

ctcgcccact tcaacctgtt tgtcgcccgt atcgccaccg ccagtttcat ggcctacgcg 360 

ctggggcaaa tcctcgacgt gcacgttttt aaccgcctgc gtcagagtcg ccgctggtgg 420 

ctggcaccga cagcgtccac actgttcggt aacgtcagcg acacgctggc ctttttcttc 480 

attgccttct ggcgtagccc ggatgccttt atggctgaac actggatgga aatcgcgctg 540 

gtcgattact gtttcaaagt gttaatcagt atcgttttct tcctgccaat gtatggcgta 600 

ttactcaata tgctgttgaa aagactggca gataaatccg aaatcaacgc tttgcaggcg 660 

agttaa 666 

<210> 83 

<211> 612 

<212> DNA 

<213> E. Coli 

<400> 83 

gtgataagat ggatgaatga gccgttatgg ccgtttatcg aaaggaagaa gtcaatgcgc 60 

aatctggtta aatatgtcgg aattggcctg ctggttatgg ggcttgcggc ctgtgatgat 120 

aaagacacta acgctacggc gcagggttcg gtcgcggaaa gtaacgctac cgggaatccc 180 

gtcaacctgc ttgatggcaa gttaagtttc tcgctgccag cggatatgac cgaccagagc 240 

ggtaagctgg gaacgcaggc caataacatg catgtctggt ccgacgccac cgggcagaaa 300 

gcagtcatcg tcatcatggg cgatgatccg aaagaagatc tggcggtgct ggcgaagcgt 360 

ctggaagatc agcaacgtag ccgcgatccg cagctgcaag tggtaaccaa taaagccatt 420 

gagctgaaag gtcacaaaat gcagcagtta gacagtatta tctccgcgaa aggccagacg 480 

gcgtactctt ccgttattct gggtaacgtg ggtaatcaac tgctgaccat gcaaattacg 540 

ctgcccgctg acgatcagca aaaagcgcag accaccgcag aaaacatcat taatacgctg 600 

gttattcagt aa 612 



<210> 84 
<211> 975 
<212> DNA 
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<213> E, Coli 



<400> 84 



atggcgaata 
tgcgtggata 
gctgccgggg 
ctggaaaccg 
atttatgaac 
tttattctgg 
atcgaaaccg 
aagcttgatt 
gtctcaaaag 
ctgccggtca 
aatggtgggg 
ggaattcgtc 
gtgccgattg 
acctggattg 
gacagccgtt 
tggatgagct 
ggcggcatcc 



tgtttgccct gattctggtg 
aattcttttt cgcacctaaa 
actcactgga taaagcaacg 
gtgcttctgt ttttccggta 
cgttccagat cccgtcaggt 
tagagaagtt tgcttatggc 
gtcatccgaa acgcggcgat 
acatcaagcg cgcggtgggt 
agctgacgat tcaaccggga 
cctactcaaa cgtggaaccg 
aagcgaccag cggattcttt 
tttccgagcg taaagagaca 
cgcaggatca ggtggggatg 
ttcctccggg acaatacttc 
actggggctt tgtgccggaa 
tcgataagca agaaggcgaa 
attaa 



attgccacac 
cggcgggaac 
ttgaaaaagg 
ctggctatcg 
tcgatgatgc 
attaaagatc 
atcgtggtct 
ttaccgggcg 
tgcagttccg 
agcgatttcg 
gaagtgccga 
ctgggtgatg 
tattaccagc 
atgatgggcg 
gcgaatctgg 
tggccgactg 



tggtgacggg 
gtcaggcagc 
ttgcgccgaa 
tattgattgt 
cgactctgtt 
ctatctacca 
ttaaatatcc 
ataaagtcac 
gccaggcgtg 
ttcagacctt 
aaaacgaaac 
tgacgcaccg 
agccagggca 
acaaccgcga 
tcggtcgggc 
gtctgcgctt 



cattttatgg 
ggcgcaggcg 
gcctggctgg 
gcgttcgttt 
aattggtgat 
gaaaacgctg 
ggaagatcca 
ttacgatccg 
tgaaaacgcg 
ctcacgccgt 
caaagaaaat 
cattctgaca 
acaactggca 
caacagcgcg 
aacggctatc 
aagtcgcatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
975 



<210> 85 
<211> 1761 
<212> DNA 
<213> E. Coli 

<400> 85 

ttgaccatta cgaaacttgc atggcgtgac ctggttcctg ataccgatag ctatcaggaa 60 

atatttgctc agccacattt gattgacgaa aacgatcctt tattcagtga tactcaaccg 120 

cggctgcaat ttgcgctgga gcagttgctg catacgcgag catcctcctc ttttatgctg 180 

gcgaaggccc cggaagagtc tgagtatctg aatcttattg ccaatgccgc gcgtacgcta 240 

caaagcgatg caggccaact ggtgggcggt cactatgagg tttccggcca ctccatccgc 300 

ttacgtcacg cagtgagtgc agatgataat tttgcgactt taacgcaagt tgtcgctgcc 360 

gactgggtag aagcggagca actctttggc tgcctgcgcc agtttaatgg cgacattacc 420 

ctgcagcctg gtctggtgca tcaggcaaat ggcggtattc tcattatctc tttgcgtaca 480 

ctgctggcgc aacctctgct gtggatgcgg ctgaaaaata tcgttaaccg cgagcgtttt 540 

gactgggttg cgtttgatga gtcgcgccct ctccccgtct ctgtgccttc gatgccattg 600 

aagctgaaag tcattctggt aggcgaacgc gaatcattgg ctgatttcca ggagatggag 660 

ccagagcttt cagagcaggc tatttatagc gaatttgaag atactctgca gattgtcgat 720 

gcggagtcag taacccagtg gtgtcgctgg gtgacattta ccgccagaca taatcactta 780 

cctgcaccgg gagcggatgc ctggccgata cttatccgcg aagcagcacg ctacaccggt 840 

gaacaagaaa cacttccgct tagcccgcag tggatcctcc gccagtgtaa agaggtcgcc 900 

tccctgtgtg atggcgacac cttctccggc gagcagctaa acttaatgct gcagcagcgt 960 

gaatggcgcg aaggtttcct cgctgaacgt atgcaggatg agatccttca ggagcaaatc 1020 

ctgattgaaa ccgaaggcga acgcatcggg caaattaacg ccctttcggt cattgaattt 1080 

ccgggtcatc cacgcgcttt tggcgaacct tctcgcatta gctgcgttgt gcatattggc 1140 

gatggtgaat tcaccgacat cgaacgcaaa gcggagcttg gcggcaatat ccatgcgaaa 1200 

gggatgatga tcatgcaagc gttcctgatg tcggaactac agcttgagca acagatcccc 1260 

ttctcagcat cgctgacatt tgagcagtca tacagtgaag ttgatggaga tagtgcctcg 1320 

atggctgaac tctgcgccct gataagcgcc ctcgccgatg tgccggtgaa tcagagtatc 1380 

gctatcacag gttcagtcga tcagttcggt cgcgcccagc cggtcggtgg tttaaatgag 1440 

aaaatcgaag gcttctttgc tatttgccag caacgtgagt taaccgggaa acaaggtgtc 1500 

attatcccca cagctaacgt tcgccattta agtcttcaca gtgaactggt gaaagcggta 1560 

gaagaaggca aattcaccat ctgggcagta gacgatgtga ctgacgcact gccgttatta 1620 

ttaaatctgg tgtgggatgg cgaaggccaa acgacgctga tgcaaaccat ccaggaacgt 1680 

atcgcgcaag catcgcaaca ggaaggacgt caccgttttc catggccatt acgttggctg 1740 

aactggttta ttccgaactg a 17 61 
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<210> 86 
<211> 1185 
<212> DNA 
<213> E. Coli 



<400> 86 

gtgtctaaag aaaaatttga acgtacaaaa ccgcacgtta acgttggtac tatcggccac 60 

gttgaccacg gtaaaactac tctgaccgct gcaatcacca ccgtactggc taaaacctac 120 

ggcggtgctg ctcgtgcatt cgaccagatc gataacgcgc cggaagaaaa agctcgtggt 180 

atcaccatca acacttctca cgttgaatac gacaccccga cccgtcacta cgcacacgta 240 

gactgcccgg ggcacgccga ctatgttaaa aacatgatca ccggtgctgc tcagatggac 300 

ggcgcgatcc tggtagttgc tgcgactgac ggcccgatgc cgcagactcg tgagcacatc 360 

ctgctgggtc gtcaggtagg cgttccgtac atcatcgtgt tcctgaacaa atgcgacatg 420 

gttgatgacg aagagctgct ggaactggtt gaaatggaag ttcgtgaact tctgtctcag 480 

tacgacttcc cgggcgacga cactccgatc gttcgtggtt ctgctctgaa agcgctggaa 54 0 

ggcgacgcag agtgggaagc gaaaatcctg gaactggctg gcttcctgga ttcttatatt 600 

ccggaaccag agcgtgcgat tgacaagccg ttcctgctgc cgatcgaaga cgtattctcc 660 

atctccggtc gtggtaccgt tgttaccggt cgtgtagaac gcggtatcat caaagttggt 720 

gaagaagttg aaatcgttgg tatcaaagag actcagaagt ctacctgtac tggcgttgaa 780 

atgttccgca aactgctgga cgaaggccgt gctggtgaga acgtaggtgt tctgctgcgt 840 

ggtatcaaac gtgaagaaat cgaacgtggt caggtactgg ctaagccggg caccatcaag 900 

ccgcacacca agttcgaatc tgaagtgtac attctgtcca aagatgaagg cggccgtcat 960 

actccgttct tcaaaggcta ccgtccgcag ttctacttcc gtactactga cgtgactggt 1020 

accatcgaac tgccggaagg cgtagagatg gtaatgccgg gcgacaacat caaaatggtt 1080 

gttaccctga tccacccgat cgcgatggac gacggtctgc gtttcgcaat ccgtgaaggc 1140 

ggccgtaccg ttggcgcggg cgttgttgct aaagttctgg gctaa 1185 

<210> 87 
<211> 2115 
<212> DNA 
<213> E. Coli 



<400> 87 

atggctcgta caacacccat cgcacgctac cgtaacatcg gtatcagtgc gcacatcgac 60 

gccggtaaaa ccactactac cgaacgtatt ctgttctaca ccggtgtaaa ccataaaatc 120 

ggtgaagttc atgacggcgc tgcaaccatg gactggatgg agcaggagca ggaacgtggt 180 

attaccatca cttccgctgc gactactgca ttctggtctg gtatggctaa gcagtatgag 240 

ccgcatcgca tcaacatcat cgacaccccg gggcacgttg acttcacaat cgaagtagaa 300 

cgttccatgc gtgttctcga tggtgcggta atggtttact gcgcagttgg tggtgttcag 360 

ccgcagtctg aaaccgtatg gcgtcaggca aacaaatata aagttccgcg cattgcgttc 420 

gttaacaaaa tggaccgcat gggtgcgaac ttcctgaaag ttgttaacca gatcaaaacc 480 

cgtctgggcg cgaacccggt tccgctgcag ctggcgattg gtgctgaaga acatttcacc 540 

ggtgttgttg acctggtgaa aatgaaagct atcaactgga acgacgctga ccagggcgta 600 

accttcgaat acgaagatat cccggcagac atggttgaac tggctaacga atggcaccag 660 

aacctgatcg aatccgcagc tgaagcttct gaagagctga tggaaaaata cctgggtggt 720 

gaagaactga ctgaagcaga aatcaaaggt gctctgcgtc agcgcgttct gaacaacgaa 780 

atcatcctgg taacctgtgg ttctgcgttc aagaacaaag gtgttcaggc gatgctggat 840 

gcggtaattg attacctgcc atccccggtt gacgtacctg cgatcaacgg tatcctggac 900 

gacggtaaag acactccggc tgaacgtcac gcaagtgatg acgagccgtt ctctgcactg 960 

gcgttcaaaa tcgctaccga cccgtttgtt ggtaacctga ccttcttccg tgtttactcc 1020 

ggtgtggtta actctggtga taccgtactg aactccgtga aagctgcacg tgagcgtttc 1080 

ggtcgtatcg ttcagatgca cgctaacaaa cgtgaagaga tcaaagaagt tcgcgcgggc 1140 

gacatcgctg ctgctatcgg tctgaaagac gtaaccactg gtgacaccct gtgtgacccg 1200 

gatgcgccga tcattctgga acgtatggaa ttccctgagc cggtaatctc catcgcagtt 1260 

gaaccgaaaa ccaaagctga ccaggaaaaa atgggtctgg ctctgggccg tctggctaaa 1320 

gaagacccgt ctttccgtgt atggactgac gaagaatcta accagaccat catcgcgggt 1380 
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atgggcgaac tgcacctcga catcatcgtt gaccgtatga agcgtgaatt caacgttgaa 1440 

gcgaacgtag gtaaaccgca ggttgcttac cgtgaaacta tccgccagaa agttaccgat 1500 

gttgaaggta aacacgcgaa acagtctggt ggtcgtggtc agtatggtca tgttgttatc 1560 

gacatgtacc cgctggagcc gggttcaaac ccgaaaggct acgagttcat caacgacatt 1620 

aaaggtggtg taatccctgg cgaatacatc ccggccgttg ataaaggtat ccaggaacag 1680 

ctgaaagcag gtccgctggc aggctacccg gtagtagaca tgggtattcg tctgcacttc 1740 

ggttcttacc atgacgttga ctcctctgaa ctggcgttta aactggctgc ttctatcgcc 1800 

tttaaagaag gctttaagaa agcgaaacca gttctgcttg agccgatcat gaaggttgaa 18 60 

gtagaaactc cggaagagaa caccggtgac gttatcggtg acttgagccg tcgtcgtggt 1920 

atgctcaaag gtcaggaatc tgaagttact ggcgttaaga tccacgctga agtaccgctg 1980 

tctgaaatgt tcggatacgc aactcagctg cgttctctga ccaaaggtcg tgcatcatac 2040 

actatggaat tcctgaagta tgatgaagcg ccgagtaacg ttgctcaggc cgtaattgaa 2100 

gcccgtggta aataa 2115 

<210> 88 
<211> 540 
<212> DNA 
<213> E. Coli 

<400> 88 

atgccacgtc gtcgcgtcat tggtcagcgt aaaattctgc cggatccgaa gttcggatca 60 

gaactgctgg ctaaatttgt aaatatcctg atggtagatg gtaaaaaatc tactgctgaa 120 

tctatcgtat acagcgcgct ggagaccctg gctcagcgct ctggtaaatc tgaactggaa 180 

gcattcgaag tagctctcga aaacgtgcgc ccgactgtag aagttaagtc tcgccgcgtt 240 

ggtggttcta cttatcaggt accagttgaa gtccgtccgg ttcgtcgtaa tgctctggca 300 

atgcgttgga tcgttgaagc tgctcgtaaa cgcggtgata aatccatggc tctgcgcctg 360 

gcgaacgaac tttctgatgc tgcagaaaac aaaggtactg cagttaagaa acgtgaagac 420 

gttcaccgta tggccgaagc caacaaggcg ttcgcacact accgttggtt atcccttcgg 4 80 

agttttagtc accaggcggg cgcttccagt aagcagcccg ctttgggcta cttaaattga 540 

<210> 89 
<211> 1549 
<212> DNA 
<213> E. Coli 

<400> 89 

aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 60 

gtcgaacggt aacaggaagc agcttgctgc ttcgctgacg agtggcggac gggtgagtaa 120 

tgtctgggaa gctgcctgat ggagggggat aactactgga aacggtagct aataccgcat 180 

aatgtcgcaa gaccaaagag ggggaccttc gggcctcttg ccatcggatg tgcccagatg 240 

ggattagctt gttggtgggg taacggctca ccaaggcgac gatccctagc tggtctgaga 300 

ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 360 

ggaatattgc acaatgggcg caagcctgat gcagccatgc cgcgtgtatg aagaaggcct 420 

tcgggttgta aagtactttc agcggggagg aagggagtaa agttaatacc tttgctcatt 480 

gacgttaccc gcagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 540 

ggtgcaagcg ttaatcggaa ttactgggcg taaagcgcac gcaggcgggt tggttaagtc 600 

agatgtgaaa tccccgggct caacctggga actgcatctg atactggcaa gcttgagtct 660 

cgtagagggg ggtagaattc caggtgtagc ggtgaaatgc gtagagatct ggaggaatac 720 

cggtggcgaa ggcggccccc tggacgaaga ctgacgctca ggtgcgaaag cgtggggagc 780 

aaacaggatt agataccctg gtagtccacg ccgtaaacga tgtcgacttg gaggttgtgc 840 

ccttgaggcg tggcttccgg agctaacgcg ttaagtcgac cgcctgggga gtacggccgc 900 

aaggttaaaa ctcaaatgaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 960 

ttcgatgcaa cgcgaagaac cttacctggt cttgacatcc acggaagttt tcagagatga 1020 

gaatgtgcct tcgggaaccg tgagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg 1080 

aaatgttggg ttaagtcccg caacgagcgc aacccttatc ctttgttgcc agcggtccgg 1140 

ccgggaactc aaaggagact gccagtgata aactggagga aggtggggat gacgtcaagt 1200 

catcatggcc cttacgacca gggctacaca cgtgctacaa tggcgcatac aaagagaagc 1260 
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gacctcgcga gagcaagcgg acctcataaa gtgcgtcgta gtccggattg gagtctgcaa 1320 

ctcgactcca tgaagtcgga atcgctagta atcgtggatc agaatgccac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accatgggag tgggttgcaa aagaagtagg 1440 

tagcttaacc ttcgggaggg cgcttaccac tttgtgattc atgactgggg tgaagtcgta 1500 

acaaggtaac cgtaggggaa cctgcggttg gatcacctcc ttaccttaa 154 9 

<210> 90 
<211> 375 
<212> DNA 
<213> E. Coli 



<400 
atggcaacag 
gtgcctgcgc 
actcctaaaa 
ttcgaagtga 
ctgatccgtg 
gcgcttgact 
cgtcctaagg 



> 90 
ttaaccagct 
tggaagcatg 
aaccgaactc 
cttcctacat 
gcggtcgtgt 
gctccggcgt 
cttaa 



ggtacgcaaa 
cccgcaaaaa 
cgcgctgcgt 
cggtggtgaa 
taaagacctc 
taaagaccgt 



ccacgtgctc 
cgtggcgtat 
aaagtatgcc 
ggtcacaacc 
ccgggtgttc 
aagcaggctc 



gcaaagttgc 
gtactcgtgt 
gtgttcgtct 
tgcaggagca 
gttaccacac 
gttccaagta 



gaaaagcaac 
atatactacc 
gactaacggt 
ctccgtgatc 
cgtacgtggt 
tggcgtgaag 



60 
120 
180 
240 
300 
360 
375 



<210> 91 
<211> 366 
<212> DNA 
<213> E. Coli 



<400> 
atgtctatca 
gtagaactga 
gtagctgctg 
gctgctggcg 
ctgaaagaag 
aaagacgacg 
aaataa 



91 

ctaaagatca 
tctctgcaat 
gcccggttga 
ctaacaaagt 
ctaaagacct 
cagaagcact 



aatcattgaa 
ggaagaaaaa 
agctgctgaa 
tgctgttatc 
ggtagaatct 
gaaaaaagct 



gcagttgcag 
ttcggtgttt 
gaaaaaactg 
aaagcagtac 
gcaccggctg 
ctggaagaag 



ctatgtctgt 
ccgctgctgc 
aattcgacgt 
gtggcgcaac 
ctctgaaaga 
ctggcgctga 



aatggacgtt 
tgctgtagct 
aattctgaaa 
tggcctgggt 
aggcgtgagc 
agttgaagtt 



60 
120 
180 
240 
300 
360 
366 



<210> 92 
<211> 498 
<212> DNA 
<213> E. Coli 



<400> 92 

atggctttaa atcttcaaga caaacaagcg attgttgctg aagtcagcga 
ggcgcgctgt ctgcagtagt tgcggattcc cgtggcgtaa ctgtagataa 
ctgcgtaaag caggtcgcga agctggcgta tacatgcgtg ttgttcgtaa 
cgccgtgctg ttgaaggtac tccgttcgag tgcctgaaag acgcgtttgt 
ctgattgcat actctatgga acacccgggc gctgctgctc gtctgttcaa 
aaagcgaatg caaaatttga ggtcaaagcc gctgcctttg aaggtgagct 
tctcagatcg accgcctggc aactctgccg acctacgaag aagcaattgc 
gcaaccatga aagaagcttc ggctggcaaa ctggttcgta ctctggctgc 
gcgaaagaag ctgcttaa 



agtagccaaa 
aatgactgaa 
caccctgctg 
tggtccgacc 
agagttcgcg 
gatcccggcg 
acgcctgatg 
tgtacgcgat 



60 
120 
180 
240 
300 
360 
420 
480 
498 



<210> 93 
<211> 2145 
<212> DNA 
<213> E. Coli 



<400> 93 

gtgtcccgta ttattatgct gatccctacc ggaaccagcg tcggtctgac cagcgtcagc 



60 
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cttggcgtga tccgtgcaat ggaacgcaaa ggcgttcgtc tgagcgtttt caaacctatc 120 

gctcagccgc gtaccggtgg cgatgcgccc gatcagacta cgactatcgt gcgtgcgaac 180 

tcttccacca cgacggccgc tgaaccgctg aaaatgagct acgttgaagg tctgctttcc 240 

agcaatcaga aagatgtgct gatggaagag atcgtcgcaa actaccacgc taacaccaaa 300 

gacgctgaag tcgttctggt tgaaggtctg gtcccgacac gtaagcacca gtttgcccag 360 

tctctgaact acgaaatcgc taaaacgctg aatgcggaaa tcgtcttcgt tatgtctcag 420 

ggcactgaca ccccggaaca gctgaaagag cgtatcgaac tgacccgcaa cagcttcggc 480 

ggtgccaaaa acaccaacat caccggcgtt atcgttaaca aactgaacgc accggttgat 540 

gaacagggtc gtactcgccc ggatctgtcc gagattttcg acgactcttc caaagctaaa 600 

gtaaacaatg ttgatccggc gaagctgcaa gaatccagcc cgctgccggt tctcggcgct 660 

gtgccgtgga gctttgacct gatcgcgact cgtgcgatcg atatggctcg ccacctgaat 720 

gcgaccatca tcaacgaagg cgacatcaat actcgccgcg ttaaatccgt cactttctgc 780 

gcacgcagca ttccgcacat gctggagcac ttccgtgccg gttctctgct ggtgacttcc 840 

gcagaccgtc ctgacgtgct ggtggccgct tgcctggcag ccatgaacgg cgtagaaatc 900 

ggtgccctgc tgctgactgg cggttacgaa atggacgcgc gcatttctaa actgtgcgaa 960 

cgtgctttcg ctaccggcct gccggtattt atggtgaaca ccaacacctg gcagacctct 1020 

ctgagcctgc agagcttcaa cctggaagtt ccggttgacg atcacgaacg tatcgagaaa 1080 

gttcaggaat acgttgctaa ctacatcaac gctgactgga tcgaatctct gactgccact 1140 

tctgagcgca gccgtcgtct gtctccgcct gcgttccgtt atcagctgac tgaacttgcg 1200 

cgcaaagcgg gcaaacgtat cgtactgccg gaaggtgacg aaccgcgtac cgttaaagca 12 60 

gccgctatct gtgctgaacg tggtatcgca acttgcgtac tgctgggtaa tccggcagag 1320 

atcaaccgtg ttgcagcgtc tcagggtgta gaactgggtg cagggattga aatcgttgat 1380 

ccagaagtgg ttcgcgaaag ctatgttggt cgtctggtcg aactgcgtaa gaacaaaggc 1440 

atgaccgaaa ccgttgcccg cgaacagctg gaagacaacg tggtgctcgg tacgctgatg 1500 

ctggaacagg atgaagttga tggtctggtt tccggtgctg ttcacactac cgcaaacacc 1560 

atccgtccgc cgctgcagct gatcaaaact gcaccgggca gctccctggt atcttccgtg 1620 

ttcttcatgc tgctgccgga acaggtttac gtttacggtg actgtgcgat caacccggat 1680 

ccgaccgctg aacagctggc agaaatcgcg attcagtccg ctgattccgc tgcggccttc 1740 

ggtatcgaac cgcgcgttgc tatgctctcc tactccaccg gtacttctgg tgcaggtagc 1800 

gacgtagaaa aagttcgcga agcaactcgt ctggcgcagg aaaaacgtcc tgacctgatg 1860 

atcgacggtc cgctgcagta cgacgctgcg gtaatggctg acgttgcgaa atccaaagcg 1920 

ccgaactctc cggttgcagg tcgcgctacc gtgttcatct tcccggatct gaacaccggt 1980 

aacaccacct acaaagcggt acagcgttct gccgacctga tctccatcgg gccgatgctg 2040 

cagggtatgc gcaagccggt taacgacctg tcccgtggcg cactggttga cgatatcgtc 2100 

tacaccatcg cgctgactgc gattcagtct gcacagcagc agtaa 2145 

<210> 94 
<211> 1767 
<212> DNA 
<213> E. Coli 

<400> 94 

atgaataatt ctattaacca taaatttcat cacattagcc gggctgaata ccaggaattg 60 

ttagccgttt cccgtggcga cgctgttgcc gattatatta ttgataatgt ctctattctc 120 

gacctgatca atggcggaga aatttccggc ccaattgtga ttaaaggacg ttacattgcc 180 

ggtgttggcg cagaatacac tgatgctccg gctttgcagc ggattgatgc tcgcggcgca 240 

acggcggtgc cagggtttat tgatgctcac ctgcatattg aatccagcat gatgacgccg 300 

gtcacttttg aaaccgctac cctgccgcgc ggcctgacga ccgttatttg cgaccctcat 360 

gaaatcgtca acgtgatggg cgaagccgga ttcgcctggt ttgcccgctg tgccgaacag 420 

gcaaggcaaa accagtactt acaggtcagc tcttgcgtac ccgccctgga aggctgcgat 480 

gttaacggtg ccagttttac ccttgaacag atgctcgcct ggcgggacca tccgcaggtt 540 

accggccttg cagaaatgat ggactaccct ggcgtaatta gcgggcagaa tgcgctgctc 600 

gataaactgg atgcatttcg ccacctgacg ctggacggtc actgcccggg tttgggtggt 660 

aaagaactta acgcctatat tactgcgggt attgaaaact gccacgaaag ttatcagctg 720 

gaagaaggac gccggaaatt acaactcggc atgtcgttga tgatccgcga agggtccgct 780 

gcccgcaatc tcaacgcgct ggcaccgttg atcaacgaat ttaacagccc gcaatgcatg 840 

ctctgtaccg atgaccgtaa cccgtgggag atcgcccatg aaggacacat cgatgcctta 900 
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attcgccgcc tgatcgaaca acacaatgtg ccgctgcatg tggcatatcg cgtcgccagc 960 

tggtcgacgg cgcgccactt tggtctgaat cacctcggct tactggcacc cggcaagcag 1020 

gccgatatcg tcctgttgag cgatgcgcgt aaggtcacgg tgcagcaggt actggtgaaa 1080 

ggcgagccga ttgatgcgca aaccttacag gcggaagagt cggcgagact ggcacaatcc 1140 

gctccgccat atggcaacac cattgcccgc cagccagttt ccgccagcga ctttgccctg 1200 

caatttacgc ccggaaaacg ctatcgggtc attgacgtca tccataacga attgattacg 1260 

cactcccact ccagcgtcta cagcgaaaat ggttttgatc gcgatgatgt gagctttatt 1320 

gccgtacttg agcgttacgg gcaacggctg gctccggctt gtggtttgct tggcggcttt 1380 

ggactgaatg aaggtgcgct ggctgcgacg gtcagccatg acagccataa tattgtggtg 1440 

atcggtcgca gtgccgaaga gatggcgctg gcggtcaatc aggtgattca ggatggcggc 1500 

gggctgtgcg tggtacgtaa cggccaggta caaagtcatc tgccgttacc cattgccggg 1560 

ctgatgagca ccgacacggc gcagtcgctg gcggaacaaa ttgacgcctt gaaagccgcc 1620 

gcccgtgaat gcggtccgtt acccgatgag ccgtttattc agatggcgtt tctttctctg 1680 

ccagtgatcc ccgcgctaaa actaaccagt caggggctat ttgatggcga gaagtttgcc 1740 

ttcactacgc tggaagtcac ggaataa 1767 

<210> 95 
<211> 1227 
<212> DNA 
<213> E. Coli 

<400> 95 

atggcgtatt gcaatccggg cctggaatcc aggccgaata agagaaacgc cctccggcgt 60 

catgtggtaa caggcatagg tatgaaaatc gtaatcgccc cagactctta taaagaaagt 120 

ttatctgcca gcgaggttgc gcaggcgata gaaaaaggat ttcgggaaat ttttcctgat 180 

gcacagtacg tttctgttcc ggttgccgac ggtggcgaag gaacggtgga agcgatgatt 240 

gcagccaccc agggggctga acgtcacgcc tgggttacag ggccgctggg cgagaaagtg 300 

aatgccagtt gggggatctc cggcgatggc aaaaccgcgt ttattgaaat ggcggcggcc 360 

agtgggctgg agctggtacc tgcggaaaaa cgcgatccac tcgtgaccac ttcacgcggc 420 

acaggcgagt taatcctgca ggcgctggag agcggtgcga caaacattat tatcggcatt 4 80 

ggcggcagcg ctacaaatga tggcggcgca ggcatggtac aggcgctggg ggcgaaatta 540 

tgcgacgcca acggcaatga aattggtttt ggcggcggta gtcttaatac tctgaatgat 600 

attgatattt ccggcctcga tccgcgctta aaagattgcg tcattcgcgt cgcttgtgat 660 

gtcaccaatc cgctggtggg cgataacggc gcatcgcgca tctttggccc acaaaaggga 720 

gccagtgaag cgatgattgt tgagctggac aataacctct ctcactatgc cgaggtcatt 780 

aaaaaagcgc tgcatgttga tgtgaaagat gtccccggtg caggagctgc gggtggtatg 840 

ggcgcggcgc taatggcgtt tcttggtgcg gaactgaaaa gtggtattga aatcgtcact 900 

acggcgctga atctggagga acatattcac gattgtacgc tggtgatcac cggtgaaggg 960 

cgtattgaca gccagagtat tcacgggaag gtaccgattg gtgtcgcaaa cgtggcgaag 1020 

aagtaccata aaccggtgat tggcattgcg ggtagcctga ccgatgatgt tggcgttgta 1080 

catcagcatg gcattgatgc ggtcttcagc gtattgacca gcataggtac gttggacgaa 1140 

gcattccgcg gggcttatga caatatctgc cgtgcttcac gtaatatcgc cgcgacactg 1200 

gcgattggaa tgcgcaacgc ggggtga 1227 

<210> 96 
<211> 900 
<212> DNA 
<213> E. Coli 

<400> 96 

atgattgata tgactatgaa agttggtttt attggcctgg ggattatggg taaaccaatg 60 

agtaaaaacc ttctgaaagc aggttactcg ctggtggttg ctgaccgtaa cccagaagct 120 

attgctgacg tgattgctgc aggtgcagaa acagcgtcta cggctaaagc gatcgctgaa 180 

cagtgcgacg tcatcataac catgctgcca aactcccctc atgtgaaaga ggtggcgctg 240 

ggtgagaatg gcattattga aggcgcgaag ccaggtacgg tattgatcga tatgagttct 300 

atcgcaccgc tggcaagccg tgaaatcagc gaagcgctga aagcgaaagg cattgatatg 360 

ctggatgctc cggtgagcgg cggtgaaccg aaagccatcg acggtacgct gtcagtgatg 420 



-30- 



gtgggcggcg acaaggctat tttcgacaaa 
tccgtggtgc ataccgggga aatcggtgca 
attgtggcgc tgaatattgc cgcgatgtca 
gttaacccgg acctggttta tcaggcaatt 
gatgccaaag cgccgatggt gatggaccgc 
catattaagg atctggcgaa tgcgctggat 
ctcacagctg cggttatgga gatgatgcag 
gatcatagcg ccctggcgtg ctactacgaa 

<210> 97 

<211> 771 

<212> DNA 

<213> E. Coli 

<400> 97 

atgaataacg atgttttccc gaataaattc aaagccgcac tggctgcgaa acaggtacaa 60 

attggttgct ggtcagcact ctctaacccg attagcactg aagttcttgg tttggctggg 120 

tttgactggc tggtgctgga tggcgaacat gcgccaaacg atatctccac gtttattccg 180 

cagttaatgg ccttgaaagg cagcgccagc gcgccagtag tgcgagtgcc gaccaacgag 240 

ccggtaatta ttaagcgtct tctggatatc ggtttctata acttcctgat tccttttgta 300 

gaaacaaaag aggaagcaga gctggcggtg gcatcaaccc gttacccacc ggaaggcatt 360 

cgcggcgtct ccgtttctca ccgcgccaat atgtttggca ccgtggcgga ttatttcgct 420 

cagtcgaaca agaacatcac tattctggtc cagatagaaa gtcagcaggg cgtagataac 480 

gtcgatgcca ttgccgctac cgaaggcgta gacggcatct tcgtcggccc cagcgatctg 540 

gccgcggcat taggccatct cggcaatgca tcacacccgg atgtacaaaa agcaattcag 600 

cacattttta accgtgccag cgcgcacggc aaacccagcg gtatcctcgc gccggtcgaa 660 

gccgatgcgc gtcgttatct ggaatggggc gcgacgtttg tggctgtcgg cagcgatctc 720 

ggcgtcttcc gctctgccac tcagaaactg gctgatacct ttaaaaaata a 771 

<210> 98 

<211> 1335 

<212> DNA 

<213> E. Coli 

<400> 98 

atgattctgg acaccgttga cgaaaaaaag aaaggcgtgc atacccgcta tttaatatta 60 

ctgattattt ttattgttac cgccgttaac tacgccgatc gtgcaacgct gtctattgct 120 

ggtaccgaag tggcaaaaga gttgcagtta agtgcggttt cgatgggtta catcttctcc 180 

gcttttggct gggcctactt gctgatgcaa atccccggcg gctggctgct tgataagttt 240 

ggctcgaaaa aagtttacac ctacagcctc tttttctggt cgctattcac cttcctgcaa 300 

ggctttgttg atatgttccc gctggcctgg gcagggatct ccatgttctt tatgcgcttt 360 

atgctcggct tctcggaagc gccatcattc ccggcgaacg cccgaattgt cgccgcctgg 420 

ttcccgacga aagaacgtgg tactgcctcc gccatcttta actcggcgca atatttctcg 480 

ctggcgctct tttcgccgct gcttggctgg ctgactttcg cctggggctg ggagcacgtc 540 

tttaccgtta tgggggtgat tggttttgtg ctgacggcgc tgtggatcaa gttgattcat 600 

aacccgacag atcacccacg tatgtctgcg gaagagctga agtttatctc tgaaaatggc 660 

gcggtggtcg atatggacca caaaaagccg ggcagtgcgg cagcaagcgg acccaaactg 720 

cattacatca agcaattgct ctctaaccgc atgatgctgg gcgtattttt cggacaatat 780 

tttatcaaca ccatcacctg gttcttcctc acctggttcc cgatttatct ggtgcaggaa 840 

aaaggcatgt cgattctgaa agtgggtctg gtcgcctcga ttccagcact gtgtggtttt 900 

gcgggcggcg tgctgggagg tgtcttctcg gattatctga tcaaacgcgg tttatccctg 960 

accctggcac gtaagctacc gattgtgctg ggaatgttgc tggcttccac catcatctta 1020 

tgtaactaca ccaacaacac cacgctggtg gtcatgctga tggcgctggc tttctttggc 1080 

aaaggatttg gtgcgctggg ctggccggtg atttctgaca ccgcgccgaa agagattgtt 1140 

ggcctctgcg gcggcgtctt taacgtcttt ggcaatgttg cctccattgt cactccactg 1200 

gtgattggct acctggtaag tgaactgcac tccttcaatg cagcactggt tttcgtggga 1260 

tgttcagcgc tgatggcgat ggtctgctac ctcttcgtag ttggcgacat taaacgtatg 1320 



tactatgatt tgatgaaagc 
ggtaacgtca ccaaactggc 
gaagcgttaa cgctggcaac 
cgcggtggac tggcgggcag 
aacttcaagc cgggcttccg 
acttctcacg gcgtcggcgc 
gcactgcgag cagatggttt 
aaactggcga aagtcgaagt 



gatggcgggt 4 80 

aaatcaggtc 540 

taaagcgggc 600 

taccgtgctg 660 

tattgatctg 720 

acaactgccg 780 

aggaacggcg 84 0 

tactcgttaa 900 
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gaattgcaga aataa 



1335 



<210> 99 
<211> 1536 
<212> DNA 
<213> E. Coli 

<400> 99 

atgcaaacga gtgatacccg cgcgttaccg ctactttgcg cccgctcggt ttataaacag 60 

tattcagggg tcaatgtcct gaaaggcatc gattttacgt tgcatcaggg ggaggtccac 120 

gccctgctcg gcggcaatgg tgccggtaaa tcgacgttaa tgaagattat tgccggtatt 180 

acccctgctg atagcggtac gctggagatt gagggcaaca actacgtcag attaacgcca 240 

gttcatgctc atcagctggg tatttatctc gttccccagg aaccgctgct tttcccaagc 300 

ctgtcgataa aagaaaacat cctgtttggg ctggcaaaaa aacagctctc catgcagaaa 360 

atgaagaact tgctggcggc gctgggctgc cagtttgatc tgcatagtct ggcaggatcg 420 

ctggatgtcg ccgatcgcca aatggtggaa atcctccgcg ggctgatgcg cgactcgcgg 480 

attctgatcc tcgatgaacc taccgcctcg cttacccctg cggaaaccga acgcttgttt 540 

agtcgcttgc aagagctgct tgctactggc gtgggtattg tttttatctc gcataagctg 600 

ccggaaattc gccagattgc cgatcgaatt agcgtgatgc gcgacggaac catcgcctta 660 

agcggcaaaa ccagcgaact gtctaccgac gacattattc aggccatcac cccagcggta 720 

cgggaaaaat cgctctctgc cagccaaaaa ttatggctgg agttacctgg taaccgccca 780 

caacatgccg ccggaacgcc ggtgctgaca ctggaaaatc tgaccggcga aggtttcagg 840 

aatgtcagcc tgacgctcaa tgccggagaa attctgggcc tggctgggct ggtgggggcc 900 

ggacgcacag aactggccga gacgctctat ggtctgcgta ctttgcgtgg cggacgcatt 960 

atgctgaatg gtaaagagat caataaatta tccactggag aacgtttact gcgcggtctg 1020 

gtttatctgc cggaagatcg ccagtcatcc ggactgaatc tcgatgcttc gctggcctgg 1080 

aacgtctgcg cccttactca taaccttcgt ggattctggg cgaaaaccgc gaaagataat 1140 

gccaccctgg aacgttatcg tcgggcgctg aatattaaat tcaaccaacc ggaacaagct 1200 

gcacggacat tatccggtgg caaccagcaa aaaatcctca ttgccaaatg cttggaagct 1260 

tcgccgcaag tattgattgt cgatgagccg acgcgcggcg tggatgtctc ggcccgtaat 1320 

gatatctacc agctgttgcg cagcatcgcc gcacaaaatg tggctgtgct gcttatctcc 1380 

tccgacctgg aagagatcga actgatggca gatcgtgtgt atgtgatgca tcagggcgaa 1440 

attacccact ctgcactgac cgagcgcgat attaatgtcg agactattat gcgcgttgcc 1500 

ttcggcgata gtcagcgtca ggaggcgtca tgctga 1536 

<210> 100 
<211> 1029 
<212> DNA 
<213> E. Coli 



<400> 100 



atgctgaagt 
tttgtattac 
tatagcagcg 
cgcaatattg 
ttactgaacg 
ctcgcgggat 
acccttggca 
attgaagggt 
gcaattggtt 
gcgtttggac 
gttcgtactg 
gcgggaattg 
gagatgaaag 
gcgatcattg 
ctgttgcgca 
ctggtgtttg 



ttattcagaa 
ccggttttct 
cgcaaatcct 
atgtttcagt 
caggatattc 
ttttcaacgg 
cgttagggtt 
tacccgccga 
ggttgacgat 
gcagttttta 
aagccattcg 
tgtttgcttc 
caattgcagc 
gtgcggtact 
ttccggcatg 
atggacgcct 



caaccgtgaa 
cgaccgccag 
gatcctgctg 
gggttcgatt 
actacctgtt 
tgtcctggtc 
gtacagaggc 
actgaaacag 
aattctggtg 
tgccacgggc 
cattgtggca 
gcagattggt 
ctgcgtgctg 
cggcgcatgg 
gtggaatgat 
gcgttgtgcg 



atcacggcac tgctggcggt ggtgctgctg 
tatttaagtg tgcaaacgct gaccatggtt 
gcaatgggcg cgacgctggt aatgcttacg 
accggaatgt gcgcggtgct gttggggatg 
gcttgtgtcg cgactttact gcttggtttg 
gcgtggctaa agatccctgc cattgttgcc 
atcatgttgc tgtggactgg cggcaaatgg 
ctctccgccc cgctgctgct tggcgtttca 
gcatttatgg cctggctgct ggcaaagacg 
gataatttac agggcgctcg tcaactgggc 
ttttcgttga acggctgcat ggcggcactg 
tttatcccca accagaccgg taccgggctg 
ggcggcatta gtttgctcgg tggttccggt 
ttcctgacgc agatcgatag cgtactggtg 
tttatcgcgg gtctggttct gctggcggtg 
ctggaacgta atctacggcg gcaaaaatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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gcccgcttta tgacgccacc gccatccgtt aaacccgctt cgtcaggtaa aaaacgggag 
gccgcataa 



1020 
1029 



<210> 101 
<211> 993 
<212> DNA 
<213> E. Coli 

<400> 101 

atgcgtattc gctacggttg ggaactggct cttgccgcac tgctcgttat tgagattgtc 60 

gcatttggtg caattaaccc gcgaatgtta gatctcaata tgttgctgtt cagcaccagt 120 

gactttatct gcattggcat tgtcgcccta ccgctaacga tggtgattgt cagtggcggg 180 

atcgatattt cgtttggttc gaccatcggc ctctgcgcca ttgcattggg cgtactgttt 240 

caaagtggtg tgccgatgcc gctggcgata ctcctgacct tactgctcgg cgcattgtgc 300 

gggctgatca acgccggatt aattatctat accaaagtta acccgctggt gattacgctt 360 

ggcacgctgt atctgtttgc cggaagcgct ctgctgcttt ccggtatggc cggagcgacg 420 

gggtacgaag gtattggtgg attcccgatg gcgtttacag atttcgctaa cctggatgtg 480 

ctgggactcc ccgttccgct gattatcttc ctgatatgtc tcctcgtttt ctggctctgg 540 

ctgcataaaa cccatgccgg acgtaatgtg tttttgattg ggcaaagccc gcgcgtggcg 600 

ctttatagcg cgattccagt taaccgtacc ttatgtgcgc tctatgccat gacggggctg 660 

gcgtctgcgg tcgccgctgt gctgctggta tcgtattttg gttcagcacg ttccgatctc 720 

ggtgcgtcgt ttctgatgcc cgccatcacc gccgtggtgc ttggcggggc caatatttat 780 

ggtggttccg gttccattat cggcaccgcc attgcggttt tattagtggg atatttgcaa 840 

caaggtttgc aaatggcagg agtgccaaat caggtgtcca gcgccctttc cggtgcgcta 900 

cttatcgtcg ttgtcgtagg tcgttccgtt agcctgcatc gccagcaaat taaagagtgg 960 

ctggcgcgtc gggccaataa cccattgcca taa 993 

<210> 102 
<211> 1023 
<212> DNA 
<213> E. Coli 

<400> 102 

atgacacttc atcgctttaa gaaaatcgcc ttacttagcg ctcttggcat tgccgcaatc 60 

tctatgaatg tgcaggccgc agagcgtatt gcatttattc ccaaactggt tggcgtggga 120 

ttttttacca gcggtggcaa cggcgcacaa caagcgggta aagagctggg cgttgatgtg 180 

acctacgacg ggccgacaga acccagtgtt tctggtcagg tacagttgat taataacttc 240 

gtcaatcaag gttataacgc cattatcgtt tctgcggttt cgcctgatgg cttgtgtccg 300 

gcactgaaac gcgccatgca acgtggtgtg agagtgctga cctgggactc tgatactaaa 360 

ccggagtgcc gctcttacta cattaatcag ggaacgcccg cccagttagg aggtatgttg 420 

gtggatatgg cggcgcgtca ggtgaataaa gacaaagcca aagtcgcgtt tttctactca 480 

agccccaccg ttacggacca aaaccagtgg gtgaaagaag cgaaagcgaa aatcgccaaa 540 

gagcatccag gctgggaaat tgtcactacg cagtttggct ataacgatgc cactaaatcg 600 

ttacaaaccg cagaaggaat attaaaagcg tatagcgatc tcgacgccat tatcgccccc 660 

gatgccaacg ccctgcccgc tgccgcacaa gccgcagaaa acttgaaaaa tgacaaagta 720 

gcgattgtcg gattcagtac gccaaatgtg atgcgcccgt atgtagagcg cggcacggtg 780 

aaagaatttg gcctgtggga tgtggttcag caaggcaaaa tttcagtgta tgtcgcggat 840 

gcattattga aaaaaggatc aatgaaaacg ggcgacaagc tggatatcaa gggcgtaggt 900 

caggttgaag tctcgccaaa cagcgttcag ggctatgact acgaagcgga tggtaatggc 960 

atcgtactgt taccggagcg cgtgatattc aacaaagaga atatcggcaa atacgatttc 1020 

1023 

<210> 103 
<211> 876 
<212> DNA 
<213> E. Coli 
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<400> 103 

atggcagatt tagacgatat taaagatggt aaagattttc gtaccgatca accgcaaaaa 60 

aatatccctt ttaccctgaa aggttgcggt gcgctggatt ggggaatgca gtcacgctta 120 

tcgcggatat ttaatccgaa aacgggtaaa accgtgatgc tggcttttga ccatggttat 180 

tttcagggac cgactaccgg acttgaacgc attgatataa atatcgcccc gctgtttgaa 240 

catgccgatg tattaatgtg tacgcgcggc attttgcgca gcgtagttcc ccctgcgacc 300 

aataggccgg tggtactgcg ggcgtcaggt gcgaactcta ttctggcgga attaagtaat 360 

gaagccgtgg cgttatcgat ggatgacgcc gtgcgcctga acagttgcgc ggtggcggcg 420 

caggtttata tcggcagcga atatgaacat cagtcgatca aaaatattat tcagctggtt 480 

gatgccggaa tgaaagtggg aatgccgacc atggccgtga ctggcgtggg caaagatatg 540 

gtgcgcgatc agcgttattt ctcgctcgcg actcgaatcg ccgctgaaat gggggcgcaa 600 

attatcaaaa cctattatgt cgaaaaaggt tttgaacgga ttgttgccgg atgtccggta 660 

cccattgtta ttgctggcgg taaaaaatta ccggagcgcg aggcgctgga aatgtgctgg 720 

caggctatcg atcagggcgc ttctggtgtg gatatggggc gtaatatttt ccagtctgac 780 

catccggtgg cgatgatgaa agccgtacag gcggtggttc accataacga aacggctgat 840 

cgggcatatg aactctatct gagtgaaaaa cagtaa 876 

<210> 104 
<211> 291 
<212> DNA 
<213> E. Coli 

<400> 104 

atgcacgtca cactggttga aattaacgtt catgaagaca aggttgacga gtttatcgaa 60 

gtttttcgcc agaaccacct gggctctgta caggaagaag gcaatttgcg cttcgatgtc 120 

ttacaggacc cggaagtgaa ttcgcgcttt tatatctacg aagcctataa agatgaagac 180 

gcagtggcgt tccataaaac cacgccccac tacaaaacct gtgtcgcgaa actggaatct 240 

ttaatgaccg ggccgcgtaa aaaacgtctg ttcaatggtt tgatgccgtg a 291 

<210> 105 
<211> 1152 
<212> DNA 
<213> E. Coli 

<400> 105 

atgtttgaac caatggaact taccaatgac gcggtgatta aagtcatcgg cgtcggcggc 60 

ggcggcggta atgctgttga acacatggtg cgcgagcgca ttgaaggtgt tgaattcttc 120 

gcggtaaata ccgatgcaca agcgctgcgt aaaacagcgg ttggacagac gattcaaatc 180 

ggtagcggta tcaccaaagg actgggcgct ggcgctaatc cagaagttgg ccgcaatgcg 240 

gctgatgagg atcgcgatgc attgcgtgcg gcgctggaag gtgcagacat ggtctttatt 300 

gctgcgggta tgggtggtgg taccggtaca ggtgcagcac cagtcgtcgc tgaagtggca 360 

aaagatttgg gtatcctgac cgttgctgtc gtcactaagc ctttcaactt tgaaggcaag 420 

aagcgtatgg cattcgcgga gcaggggatc actgaactgt ccaagcatgt ggactctctg 480 

atcactatcc cgaacgacaa actgctgaaa gttctgggcc gcggtatctc cctgctggat 540 

gcgtttggcg cagcgaacga tgtactgaaa ggcgctgtgc aaggtatcgc tgaactgatt 600 

actcgtccgg gtttgatgaa cgtggacttt gcagacgtac gcaccgtaat gtctgagatg 660 

ggctacgcaa tgatgggttc tggcgtggcg agcggtgaag accgtgcgga agaagctgct 720 

gaaatggcta tctcttctcc gctgctggaa gatatcgacc tgtctggcgc gcgcggcgtg 780 

ctggttaaca tcacggcggg cttcgacctg cgtctggatg agttcgaaac ggtaggtaac 840 

accatccgtg catttgcttc cgacaacgcg actgtggtta tcggtacttc tcttgacccg 900 

gatatgaatg acgagctgcg cgtaaccgtt gttgcgacag gtatcggcat ggacaaacgt 960 

cctgaaatca ctctggtgac caataagcag gttcagcagc cagtgatgga tcgctaccag 1020 

cagcatggga tggctccgct gacccaggag cagaagccgg ttgctaaagt cgtgaatgac 1080 

aatgcgccgc aaactgcgaa agagccggat tatctggata tcccagcatt cctgcgtaag 1140 

caagctgatt aa 1152 

<210> 106 



"34- 



<211> 3048 
<212> DNA 
<213> E. Coli 



<400> 106 

atggacgtca gtcgcagaca attttttaaa atctgcgcgg gcggtatggc tggaacaaca 60 

gtagcggcat tgggctttgc cccgaagcaa gcactggctc aggcgcgaaa ctacaaatta 120 

ttacgcgcta aagagatccg taacacctgc acatactgtt ccgtaggttg cgggctattg 180 

atgtatagcc tgggtgatgg cgcaaaaaac gccagagaag cgatttatca cattgaaggt 24 0 

gacccggatc atccggtaag ccgtggtgcg ctgtgcccga aaggggccgg tttgctggat 300 

tacgtcaaca gtgaaaaccg tctgcgctac ccggaatatc gtgcgccagg ttctgacaaa 360 

tggcagcgca ttagctggga agaagcattc tcccgtattg cgaagctgat gaaagctgac 420 

cgtgacgcta actttattga aaagaacgag cagggcgtaa cggtaaaccg ttggctttct 480 

accggtatgc tgtgtgcctc cggtgccagc aacgaaaccg ggatgctgac ccagaaattt 54 0 

gcccgctccc tcgggatgct ggcggtagac aaccaggcgc gcgtctgaca cggaccaacg 600 

gtagcaagtc ttgctccaac atttggtcgc ggtgcgatga ccaaccactg ggtggatatc 660 

aaaaacgcta acgtcgtgat ggtgatgggc ggtaacgctg ctgaagcgca tcccgtcggt 720 

ttccgctggg cgatggaagc gaaaaacaac aacgacgcaa ccttgatcgt tgtcgatccc 780 

cgttttacgc gtaccgcttc tgtggcggat atttacgcgc ctattcgttc cggtacggac 840 

attacgttcc tgtctggcgt tttgcgctac ctgatcgaaa acaacaaaat caacgccgaa 900 

tacgttaagc attacaccaa cgccagcctg ctggtgcgtg atgattttgc tttcgaagac 960 

ggtctgttca gcggctacga cgctgaaaaa cgtcaatacg ataaatcgtc ctggaactat 1020 

cagctcgatg aaaacggcta tgcgaaacgc gatgaaacac tgactcatcc gcgctgtgtg 1080 

tggaacctgc tgaaagagca cgtttcccgc tacacgccgg acgtcgttga aaacatctgc 1140 

ggtacgccaa aagccgactt cctgaaagtg tgtgaagtgc tggcctccac cagcgcaccg 1200 

gatcgcacaa ccaccttcct gtacgcgctg ggctggacgc agcacactgt gggtgcgcag 12 60 

aacatccgta ctatggcgat gatccagttg ctgctcggta acatgggtat ggccggtggc 1320 

ggcgtgaacg cattgcgtgg tcactccaac attcagggct tgactgactt aggcctgctc 1380 

tctaccagcc tgccaggtta tctgacgctg ccgtcagaaa aacaggttga tttgcagtcg 1440 

tatctggaag cgaacacgcc gaaagcgacg ctggctgatc aggtgaacta ctggagcaac 1500 

tatccgaagt tcttcgttag cctgatgaaa tctttctatg gcgatgccgc gcagaaagag 1560 

aacaactggg gctatgactg gctgccgaag tgggaccaga cctacgacgt catcaagtat 1620 

ttcaacatga tggatgaagg caaagtcacc ggttatttct gccagggctt taacccggtt 1680 

gcgtccttcc cggacaaaaa caaagtggtg agctgcctga gcaagctgaa gtacatggtg 1740 

gttatcgatc cgctggtgac tgaaacctct accttctggc agaaccacgg tgagtcgaac 1800 

gatgtcgatc cggcgtctat tcagactgaa gtattccgtc tgccttcgac ctgctttgct 1860 

gaagaagatg gttctatcgc taactccggt cgctggttgc agtggcactg gaaaggtcag 1920 

gacgcgccgg gcgaagcgcg taacgacggt gaaattctgg cgggtatcta ccatcatctg 1980 

cgcgagctgt accagtccga aggtggtaaa ggcgtagaac cgctgatgaa gatgagctgg 2040 

aactacaagc agccgcacga accgcaatct gacgaagtgg ctaaagagaa caacggctac 2100 

gcgctggaag atctctatga cgctaatggc gtgcttattg cgaagaaagg tcagttgctg 2160 

agtagctttg cgcatctgcg tgatgacggt acaaccgcat cttcttgctg gatctacacc 2220 

ggtagctgga cagagcaggg caaccagatg gctaaccgcg ataactccga cccgtccggt 2280 

ctggggaata cgctgggatg ggcctgggcg tggccgctca accgtcgcgt gctgtacaac 2340 

cgtgcttcgg cggatatcaa cggtaaaccg tgggatccga aacggatgct gatccagtgg 2400 

aacggcagca agtggacggg taacgatatt cctgacttcg gcaatgccgc accgggtacg 24 60 

ccaaccgggc cgtttatcat gcagccggaa gggatgggac gcctgtttgc catcaacaaa 2520 

atggcggaag gtccgttccc ggaacactac gagccgattg aaacgccgct gggcactaac 2580 

ccgctgcatc cgaacgtggt gtctaacccg gttgttcgtc tgtatgaaca agacgcgctg 2640 

cggatgggta aaaaagagca gttcccgtat gtgggtacga cctatcgtct gaccgagcac 2700 

ttccacacct ggaccaagca cgcattgctc aacgcaattg ctcagccgga acagtttgtg 2760 

gaaatcagcg aaacgctggc ggcggcgaaa ggcattaata atggcgatcg tgtcactgtc 2820 

tccagcaagc gtggctttat ccgcgcggtg gctgtggtaa cgcgtcgtct gaaaccgctg 2880 

aatgtaaatg gtcagcaggt tgaaacggtg ggtattc6aa tccactgggg ctttgagggt 2940 

gtcgcgcgta aaggttatat cgctaacact ctgacgccga atgtcggtga tgcaaactcg 3000 

caaacgccgg aatataaagc gttcttagtc aacatcgaga aggcgtaa 3048 
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<210> 107 
<211> 885 
<212> DNA 
<213> E. Coli 



<400> 107 

atggctatgg aaacgcagga cattatcaaa aggtccgcaa ctaactccat cacgccgcct 60 

tctcaggtgc gtgattacaa agcagaagtc gcaaaactta tcgacgtttc cacctgtatc 120 

ggctgtaaag cctgtcaggt ggcgtgttcg gagtggaacg acatccgtga tgaagtgggg 180 

cactgcgtcg gggtttacga taaccccgcc gatctgagcg ccaagtcctg gacggtgatg 240 

cgctttagcg aaaccgaaca gaacggcaag ctggagtggc tgatccgtaa agacggctgt 300 

atgcactgtg aagatcccgg ctgcctgaag gcgtgcccgt ctgctggtgc aatcattcag 360 

tacgctaacg ggattgtcga tttccagtcg gaaaactgca tcggctgtgg ttactgcatt 420 

gccgggtgtc cgtttaatat tccgcgcctc aacaaagagg ataaccgggt atataaatgc 480 

acgctctgcg tcgatcgcgt cagcgtcggc caggaaccgg cttgtgtgaa aacctgtccg 54 0 

accggggcta tccacttcgg caccaagaag gagatgctgg agctggcgga acagcgcgtg 600 

gcgaaactga aagcgcgtgg ttacgaacat gctggcgtct acaacccgga aggggtcggt 660 

ggtacgcacg ttatgtacgt gctgcatcac gccgatcagc cggagctgta tcacggtctg 720 

ccgaaagatc cgaagatcga cacctcggta agcctgtgga aaggcgcgtt gaaaccgctg 780 

gcagcggctg gctttattgc cacttttgcc gggttgattt tccactacat cggtattggc 840 

ccgaataagg aagtggacga tgacgaggag gatcatcatg agtaa 885 

<210> 108 
<211> 654 
<212> DNA 
<213> E. Coli 



<400> 108 

atgagtaagt cgaaaatgat tgtgcgcacc aaatttattg atcgcgcctg tcactggacc 60 

gtggtgattt gcttcttcct ggtggcgctg tccgggattt cgttcttctt cccgacgctg 120 

caatggctga cgcaaacctt cggtacgccg cagatgggac gcattttgca cccgttcttc 180 

ggcattgcga ttttcgtcgc actgatgttt atgtttgtgc gttttgtgca tcacaacatc 240 

ccggataaga aagatattcc gtggctgttg aacattgtcg aagtattgaa aggcaatgag 300 

cataaagtgg cggatgtcgg taagtacaac gccgggcaaa agatgatgtt ctggtcgatc 360 

atgagcatga ttttcgtgct gctggtgacc ggggtgatta tctggcgtcc gtactttgcg 420 

cagtacttcc cgatgcaggt tgttcgctac agcctgctga tccacgcggc tgcgggtatc 480 

atcctgatcc acgccatcct gatccatatg tatatggcat tttgggtgaa aggatcgatt 540 

aaagggatga tcgaagggaa ggtaagtcgt cgctgggcga agaaacacca tccgcgctgg 600 

tatcgtgaaa tcgagaaggc agaagcgaaa aaagagagtg aagaagggat ataa 654 

<210> 109 
<211> 261 
<212> DNA 
<213> E. Coli 



<400> 109 

atggcgttgt taatcactaa aaaatgcatc aattgtgata tgtgtgaacc cgaatgcccg 60 

aatgaggcga tttcaatggg agatcatatc tacgagatta acagcgataa gtgtaccgaa 120 

tgcgtagggc actacgagac accaacctgc cagaaggtgt gcccgatccc caatactatt 180 

gtgaaagatc cggcgcatgt cgagacagaa gaacagttgt gggataaatt tgtgctgatg 240 

caccacgcgg ataaaattta a 261 

<210> 110 
<211> 1203 
<212> DNA 
<213> E. Coli 
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<400> 110 

atgcaaagtg ttgatgtagc cattgttggc ggcggcatgg tggggctggc ggttgcctgt 60 

ggcttacagg ggagcggctt acgcgttgcc gtactggagc agcgcgtaca ggaacctctg 120 

gcggcgaatg caccaccaca actgcgcgtt tcggctatca atgccgccag cgaaaaatta 180 

ctcacccgtc ttggcgtctg gcaggacatt ctctctcgta gggccagctg ttatcacggt 240 

atggaagtgt gggacaaaga cagctttggt cacatttcgt ttgacgatca aagcatgggc 300 

tatagccatc ttgggcatat cgttgaaaat tcagtgattc actacgcgct gtggaacaaa 360 

gcgcatcagt cgtcagatat cactctgtta gcccccgcag aattacagca ggtcgcctgg 420 

ggagaaaatg aaaccttcct gacgctgaaa gatggcagca tgttaacggc gcgtctggtg 480 

attggcgcgg acggcgctaa ttcctggttg cgcaacaaag ccgatattcc gctgactttc 540 

tgggattatc agcatcacgc gctggtagcg accattcgca cggaagaacc gcatgatgcg 600 

gtggcgcggc aggttttcca tggcgaaggc attctggcct ttttaccgct tagcgatccg 660 

catctttgct cgattgtctg gtcactgtcg ccagaggaag cgcagcggat gcagcaggca 720 

agtgaagacg aatttaatcg cgcgttaaat atcgcttttg ataatcgcct gggcttatgc 780 

aaggttgaga gcgcgcgtca ggtgttccca ctgacggggc gttatgcgcg ccagtttgcc 840 

tcgcaccgtc tggcgctggt gggcgacgcc gcacatacca ttcacccgct ggcggggcag 900 

ggggtaaatc tcggctttat ggatgctgca gagctgattg ccgaactgaa acggttgcat 960 

cgtcagggga aagacatcgg gcagtacatt tatctgcgtc gctatgagcg tagccgcaag 1020 

cacagtgcgg cgttgatgct ggctggtatg cagggattcc gcgatctgtt ttccggtacc 1080 

aatccggcga aaaaactgct gcgtgatatt ggtttgaaac tggccgacac gcttcctggc 1140 

gttaagccgc aacttatccg ccaggcaatg ggattaaacg atttgcctga atggctgcgt 1200 

1203 

<210> 111 
<211> 1179 
<212> DNA 
<213> E. Coli 

<400> 111 

atgagcgtaa tcatcgtcgg tggcggcatg gcgggcgcga cgctggcgct ggctatttcc 60 

cggttaagtc acggggcgct gccggtacat ttgattgaag cgactgcgcc agagtcacat 120 

gctcatccgg gctttgatgg acgagcgata gcgctggcgg cgggtacctg tcagcaactg 180 

gcgcgcatcg gcgtctggca atctctggcg gattgcgcaa ctgccatcac caccgtgcat 240 

gtcagcgatc gtggtcacgc tggatttgtc accctcgccg cagaagatta ccaactggcg 300 

gcgctgggac aggttgtcga attgcacaat gtcgggcaac ggctgtttgc attgctgcgt 360 

aaagcacctg gcgtaacgct gcattgccct gatcgcgtgg ctaacgttgc ccgtactcag 420 

agtcacgttg aagtgacgct ggagagtggc gagacgctga cgggccgcgt gctggtagca 4 80 

gctgatggca cccattcagc gttagccacc gcgtgcggcg ttgactggca gcaggagcct 540 

tacgaacaac tggccgtgat tgccaacgtt gctacttccg ttgcgcatga agggcgcgct 600 

tttgaacgct ttacgcaaca tggcccgctg gcgatgttgc cgatgtctga cggacgctgt 660 

tcgctggtct ggtgtcatcc actggaacgg cgcgaagagg tgttgtcgtg gagtgacgag 720 

aagttttgcc gtgaactcca gtcggccttt ggctggcgac ttgggaaaat tacccacgct 780 

ggtaaacgca gtgcttatcc gctggcgtta acccacgccg ccagatctat tacccatcgt 840 

accgtgctgg tgggcaatgc ggcgcaaact ctgcacccga ttgccgggca agggtttaac 900 

ctcggtatgc gagatgtgat gagtcttgcg gaaaccctga ctcaggcgca ggagcgcgga 960 

gaagacatgg gggattacgg cgtattgtgc cgttatcagc agcgtcgaca gagcgatcgc 1020 

gaagcaacca ttggcgtcac ggacagcctt gtacatcttt ttgccaaccg ttgggcaccg 1080 

ctggttgtcg ggcgcaacat cgggctgatg acgatggaat tattcacccc ggcacgcgat 1140 

gtgctggcgc agcgcaccct cggttgggtg gcgcgttga 1179 

<210> 112 
<211> 1326 
<212> DNA 
<213> E. Coli 

<400> 112 

atgagtgaga tatcccggca agagtttcag cgtcgccgtc aggccctggt ggagcaaatg 60 
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caacccggca 
gaatacccct 
gcggtgctgg 
gttcgcgacc 
gagaaactgg 
ctacttaacg 
atcgtgaaca 
gcaacgatga 
gagattgccg 
gaaaaatgcc 
aaccgccacg 
tgcattctgc 
gacgcgggtt 
ggcaaattca 
agcctgcgcc 
atcatggtta 
gctcagaacg 
gtccatgacg 
ctgaccgtag 
ggtatcggca 
accgccagcg 
caatga 



gcgccgcgct 
atcgtcagaa 
tgctgattaa 
tgacggcgga 
gcgttgaccg 
gcctggatgt 
gtgcgctgga 
tcgactggcg 
tactccgccg 
gtccgggaat 
gtgcgcgcta 
actacaccga 
gtgaatacaa 
cccaggccca 
tgtatcgtcc 
gcggcctggt 
cccatcgtcc 
tgggtgttta 
agccagggct 
ttcgtattga 
tggtgaaaaa 



gatttttgct 
cagtgacttc 
aagcgatgac 
gatctggttt 
cgcactggca 
ggtttaccat 
aaaactgcgt 
tcctgttgtt 
cgcgggagaa 
gttcgagtac 
tccgtcctat 
aaacgagtgt 
aggttacgct 
gcgtgaaatc 
gggaacttcc 
aaaactcggc 
tttctttatg 
tggtcaggat 
gtatattgcg 
agacgacatt 
gccggaagaa 



gcaccagaag 
tggtacttca 
actcataacc 
ggccgtcgct 
ttcagcgaaa 
gcccagggcg 
aaaggttcgc 
catgaaatgc 
atcaccgcca 
catctggaag 
aacaccattg 
gaaatgcgcg 
ggcgatatta 
tacgacattg 
attctggaag 
atcctgaaag 
catggcctta 
cgctcgcgca 
ccggatgcag 
gtgattaccg 
atcgaagcgt 



taacacgtag 
ccggctttaa 
acagcgttct 
taggccagga 
tcaatcagca 
aatatgcata 
ggcaaaatct 
gcctgttcaa 
tggcacatac 
gcgaaattca 
tcggcagcgg 
acggcgacct 
cccgcacctt 
tgctggagtc 
tcactggtga 
gtgatgttga 
gccactggtt 
ttctggaacc 
aagtgccaga 
aaaccggtaa 
tgatggttgc 



cgccgacagc 
cgaaccggaa 
gtttaaccgc 
tgccgcgcca 
actttatcaa 
tgctgatgta 
caccgcaccg 
atcgccagaa 
acgggcgatg 
ccacgaattt 
tgaaaacggc 
ggtgttgatt 
cccggtcaac 
tctcgaaacc 
agtggtgcgc 
tgaactgatc 
aggactggat 
gggcatggta 
acaatatcgc 
cgaaaacctc 
tgcgagaaag 



<210> 113 
<211> 585 
<212> DNA 
<213> E. Coll 



<400 
atgcttatgt 
caacaaggga 
ggcggtaacg 
gctttcggtc 
ctgcaggatg 
ttcgatcggg 
acgcaaccga 
aacattgcgc 
gaagagatca 
ccgcaaccga 



> 113 
ctatacagaa 
cgggtctgac 
atgacagctc 
atgagctggc 
acggcttcct 
ctgatgcatt 
agctggataa 
aactgggtta 
tcgaatacgt 
ccgcgccaga 



cgaaatgcct 
cccagctgag 
atggctaccg 
acaggcactg 
ttttcagctt 
ggcaggttgg 
agtgaccggc 
cgacgaagac 
tcgtgttgcc 
agtacaaaaa 



ggttacaacg 
atgcatggtt 
ctacttcacg 
cgtaaaatgc 
tatctgcctg 
gtcaatcact 
gaaaccggtg 
gaagatcagg 
gcgctgttat 
ccgactctac 



aaatgaacca 
taatcagcgg 
acctgacgaa 
actctgccac 
atggcgatga 
tcctgcttgg 
aagctatcga 
aagagcttga 
gccacgacac 
actaa 



gtatctgaac 
gatgatatgt 
cgaaggcatg 
cagcgatgcc 
tgtcagcgtt 
tcttggcgtt 
cgatctgcgt 
aatgtcgctt 
ctttactcat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1326 



60 
120 
180 
240 
300 
360 
420 
480 
540 
585 



<210> 114 
<211> 363 
<212> DNA 
<213> E. Coli 



<400 
atgttaaagc 
gtggtttttg 
ggtttcgttg 
tcgactacaa 
gttggatggg 
gccatcagcc 
tga 



> 114 
tatttgcaaa 
gtgtttgtat 
tggctgtgag 
cgatgcgcta 
ctgctgatag 
tggtgtgcgg 



gtacacctct 
ctatgtcgcg 
ctttagcttc 
catgctatat 
atgcgcactt 
tttcgtctat 



<210> 115 
<211> 921 
<212> DNA 



attggtgtgc 
catacaaacc 
ttcgcgaatg 
gttgggttca 
cccccgatga 
tcaaagttca 



tgaacaccct 
aagctcttgc 
caaaattcac 
tggggacact 
taactcttgt 
ttgtctttag 



tatacactgg 
aaacttcgca 
attcaaggca 
gagtgctact 
caccttctcc 
ggatgcgaaa 



60 
120 
180 
240 
300 
360 
363 



-38- 



<213> E. Coli 



<400> 115 

atgaagatat ctcttgtagt tcctgtcttc aatgaagaag aagcgatacc aattttttat 60 

aaaacggtac gtgaattcga agaattgaag tcatatgaag tggaaatcgt tttcataaat 120 

gacggcagca aagacgctac ggagtcaatc attaatgctc tggctgtttc agatcctcta 180 

gttgttccgc tgtcatttac acgcaacttt ggtaaagaac cagcattgtt tgcagggtta 240 

gaccatgcaa ccggggatgc gataatccca attgatgttg acctgcaaga cccgattgag 300 

gttattcctc atcttattga aaaatggcaa gcaggtgctg atatggttct tgctaaaaga 360 

tctgaccgct caactgatgg acgcctgaag cgaaaaacgg ctgagtggtt ctataagctc 420 

cacaataaaa taagcaatcc taaaattgaa gagaatgttg gtgatttcag gctgatgagc 480 

cgtgatgttg tcgaaaatat taaacttatg ccagaacgaa accttttcat gaaaggtatt 540 

ctgagctggg taggaggaaa gacagatatt gttgaatacg tgcgagcgga aagaattgct 600 

ggagatacaa aatttaatgg atggaaactt tggaatttag cacttgaggg tattacaagc 660 

ttttccacat tccctcttcg catctggaca tacatagggt tagtggtagc cagtgtagca 720 

tttatttatg gggcgtggat gattttagat actatcatat ttggaaatgc tgttagggga 780 

tatccttcac tacttgtttc aatactgttt ttaggtggaa ttcagatgat tggaatagga 840 

gtattaggtg aatatattgg acgcacatac attgaaacca aaaaacgccc gaaatacatc 900 

atcaagagag tcaaaaaatg a 92i 

<210> 116 
<211> 1332 
<212> DNA 
<213> E. Coli 



<400> 116 

atgaataaag caataaaagt atcattgtat atatcttttg ttttgattat ttgcgcctta 60 

tctaaaaaca taatgatgtt aaatacatct gatttcggaa gagccattaa gccattaatt 120 

gaagacatac cagcatttac atatgactta cctttattgt ataaattgaa aggtcatatt 180 

gattcaattg atagctatga gtatataagt tcatatagtt atattttgta tacatacgtc 240 

ctgtttatta gcatttttac tgaatatctt gatgctaggg tgttatcgtt atttctaaaa 300 

gtaatatata tttattcatt atatgcgata tttacttcat atataaaaac agaaaggtat 360 

gtaactttat ttacattctt tattttagct tttcttatgt gttcttcatc aacactgtca 420 

atgtttgcat cattctatca agagcaaata gttataattt tccttccatt tttggtgtat 480 

tcattaacat gcaaaaacaa taaatctatg cttttgctat ttttttcgtt gctaataata 540 

tctactgcta aaaatcaatt tatattaacc ccactaatag tgtattcata ttatattttt 600 

tttgatagac acaaactaat tattaaatct gtaatatgcg tggtgtgctt gcttgcgtca 660 

atatttgcaa tatcttattc aaaaggtgtt gttgaattaa ataagtacca tgcaacatac 720 

ttcggtagtt atctttatat gaaaaacaac gggtataaaa tgccatcgta tgttgatgat 780 

aagtgtgttg ggttagatgc ctggggtaat aaattcgaca tatcatttgg cgcaacccca 840 

acagaagttg gaacggaatg tttcgaatct cataaagatg aaacgttttc gaatgcactc 900 

tttttattgg ttagcaaacc aagcaccatc ttcaaacttc catttgatga tggtgtgatg 960 

tctcagtata aagaaaatta tttccatgta tataaaaaac tacacgtaat atatggagaa 1020 

tcaaacatac taacgactat tactaacata aaagacaata tatttaaaaa cattagattt 1080 

atatcattgt tattattttt tattgcttct atttttatta gaaataataa aataaaggca 1140 

tctttatttg tagtatctct ttttggaata tctcaatttt atgtgtcatt tttcggggaa 1200 

ggatatagag atttaagcaa gcatttattt ggaatgtatt tttcgttcga cctttgctta 1260 

tacataacag tcgttttttt aatttataaa ataattcaaa gaaatcaaga caatagcgat 1320 

gtaaagcact aa 1332 

<210> 117 
<211> 249 
<212> DNA 
<213> E. Coli 

<400> 117 

atgggcattc tgtcatggat tatttttggg cttattgccg gtattctggc gaagtggatc 60 
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atgccaggta aagatggagg tggattcttt atgactatcc tgctggggat agtcggtgcc 120 

gtagtcggcg gatggatcag cacgctgttt ggctttggta aagtcgatgg cttcaatttt 180 

ggcagcttcg tggttgccgt tattggtgcg attgtcgtgc tatttatcta caggaagatt 24 0 

aaaagttaa 249 

<210> 118 

<211> 183 

<212> DNA 

<213> E. Coli 



<400> 118 

atgggcaaag caacgtatac cgtgaccgtc accaataaca gcaatggcgt ttctgtcgat 
tatgaaacag agacgccgat gactttgctg gtgccagaag tggcggctga agtgataaaa 
gatctggtga ataccgtacg ttcttatgac acggaaaacg aacatgatgt ttgtggttgg 
taa 



60 
120 
180 
183 



<210> 119 
<211> 360 
<212> DNA 
<213> E. Coli 

<400> 119 
atgcttcaaa tcccacagaa 
actgcacctg ccggaatatt 
ccacgccttt ccgttatgca 
gcagagcctg atcaggtgat 
aagtggcaca acattgaagc 
gctcctgaag tcctgatgga 

<210> 120 

<211> 741 

<212> DNA 

<213> E. Coli 



ttatattcat 
cgaacgtcat 
tggggcggtc 
ccttatcgaa 
catgactgac 
sggtgcgcaa 



acgcgctcaa 
cttgataaag 
aaatatctcg 
gcggggcagt 
gatacttatt 
caacggaaag 



cgcctttctg 
gaacgcgccc 
gctacgctga 
ttgcggtgtt 
tcaacattga 
tcattcataa 



gaataaacaa 
gggggtttac 
tgaacacagt 
ccctccagaa 
cttcttcgtg 
cgggaaatga 



60 
120 
180 
240 
300 
360 



<400 
gtgaagttca 
gctaacgaat 
gtgccagaca 
gccaagaaac 
atcgcgaaaa 
gatggtaaaa 
cagttagaca 
cgttctgtgt 
gcgattgata 
gggccggtat 
atgatgaaag 
cagtttgatg 
aaaacacctg 



> 120 
aagttatcgc 
tgccggatgg 
ttgccactct 
aggcagatga 
aagatatcag 
gtatccttaa 
aactgaattc 
cgctgggcgt 
acgcgattca 
atagcgtgcg 
ccgatgccgc 
atcaggtcga 
cagcacaata 



cctggcggca 
accgcatatt 
tgcgattgaa 
gcgcgtcgca 
ctcagcgaac 
aggctaccgc 
cttgctggat 
ggcgcagccg 
tcaggcgcag 
ctaccatgtt 
gccggtgtcc 
tgtggtcttc 
a 



ttaatgggta 


ttagcgggat 


ggcagcgcag 


60 


gtcacctccg 


gtacggcaag 


cgtggatgcg 


120 


gttaacgtgg 


ccgcgaagga 


tgccgctact 


180 


caatacattt 


ccttccttga 


actcaatcag 


240 


ttacgcaccc 


agccagatta 


tgattatcag 


300 


gctgtgagaa 


cggtggaagt 


cacgctccgt 


360 


ggcgcgctga 


aggcgggtct 


taacgaaatt 


420 


gatgcctata 


aagacaaagc 


gcgtaaggca 


480 


gaactggcga 


acggctttca 


tcgtaaactg 


540 


tccaactatc 


agcccagccc 


aatggtgcgg 


600 


gcccaggaaa 


cttacgagca 


ggccgctatt 


660 


cagttagaac 


ctgtggatca 


acaacccgct 


720 








741 



<210> 121 
<211> 1395 
<212> DNA 
<213> E. Coli 



<400> 121 

gtgttattac tggatgcgtg ctcgcaaatg tgcccgtcat tcagacgatt ccagacagtg 60 
tttcataatt cctccatttt tctcccttat tggctggcta cactagtatc attccgcgaa 120 
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acgtttcagg aagagaaact cttaacgatg aaaggtagtt ataaatcccg ttgggtaatc 180 

gtaatcgtgg tggttatcgc cgccatcgcc gcattctggt tctggcaagg ccgcaatgac 240 

tcccggagtg cagccccagg ggcgacgaaa caagcgcagc aatcgccagc gggtggtcga 300 

cgtggtatgc gttccggccc attagccccg gttcaggcgg cgaccgccgt agaacaggca 360 

gttccgcgtt acctcaccgg gcttggcacc attaccgccg ctaataccgt tacggtgcgc 420 

agccgcgtgg acggccaact gatagcgtta catttccagg aaggccagca ggtcaaagca 480 

ggcgatttac tggcagaaat tgaccccagc cagttcaaag ttgcattagc acaagcccag 540 

ggccaactgg caaaagataa agccacgctt gccaacgccc gccgtgacct ggcgcgttat 600 

caacaactgg caaaaaccaa tctcgtttcc cgccaggagc tggatgccca acaggcgctg 660 

gtcagtgaaa ccgaaggcac cattaaggct gatgaagcaa gcgttgccag cgcgcagctg 720 

caactcgact ggagccggat taccgcacca gtcgatggtc gcgttggtct caagcaggtt 780 

gatgttggta accaaatctc cagtggtgat accaccggga tcgtggtgat cacccagacg 840 

catcctatcg atttagtctt taccctgccg gaaagcgata tcgctaccgt agtgcaggcg 900 

cagaaagccg gaaaaccgct ggtggtagaa gcctgggatc gcaccaactc gaagaaatta 960 

agtgaaggca cgctgttaag tctagataac caaatcgatg ccactaccgg tacgattaaa 1020 

gtgaaagcac gctttaataa tcaggatgat gcgctgtttc ccaatcagtt tgttaacgcg 1080 

cgcatgttag tcgacaccga acaaaacgcc gtagtgatcc caacagccgc cctgcaaatg 1140 

ggcaatgaag gccattttgt ctgggtgctg aatagcgaaa acaaggtcag caaacatctg 1200 

gtgacgccgg gcattcagga cagtcagaaa gtggtgatcc gtgcaggtat ttctgcgggc 12 60 

gatcgcgtgg tgacagacgg cattgatcgc ctgaccgaag gggcgaaagt ggaagtggtg 1320 

gaagcccaga gcgccactac tccggaagag aaagccacca gccgcgaata cgcgaaaaaa 1380 

ggagcacgct cctga 1395 

<210> 122 
<211> 3123 
<212> DNA 
<213> E. Coli 

<400> 122 

atgcaggtgt tacccccgag cagcacaggc ggcccgtcgc gcctgtttat tatgcgtcct 60 

gtggccacca cgctgctgat ggtggcgatc ttactcgccg ggattatcgg ttatcgcgcc 120 

ctgcccgttt cggcgctgcc ggaagtggac tatccgacca ttcaggtggt cacgctctac 180 

ccaggtgcca gcccggatgt catgacctct gccgttaccg cgccgctaga acgccagttc 240 

gggcagatgt ctggcctgaa acagatgtcg tcgcaaagtt ccggcggtgc gtcagttatc 300 

actttgcagt tccagctaac attaccgctc gatgtcgccg agcaggaagt gcaggccgcg 360 

attaacgctg cgaccaactt gttgccgagc gatctgccta acccgccggt ttacagcaaa 420 

gtgaacccgg cagatccgcc gatcatgacg ctcgccgtca cctcaaccgc catgccgatg 480 

acgcaagtgg aagatatggt ggaaacccgc gtcgcgcaga aaatctcgca gatttccggc 540 

gtcggcctgg tgacgctttc cggcggtcag cgtccggctg ttcgcgtcaa acttaacgct 600 

caggcgattg ccgccctcgg cctgaccagc gaaaccgtgc gcaccgccat taccggcgct 660 

aacgttaact cggcaaaagg tagcctcgac ggcccttccc gtgcggtcac gctttccgcg 720 

aacgaccaga tgcaatccgc cgaagagtat cgccagctaa tcatcgccta ccagaacggc 780 

gcgccaattc gtctgggcga tgtcgcaact gtagagcaag gtgcagaaaa cagctggctc 840 

ggcgcgtggg cgaacaaaga acaggccatt gtgatgaatg ttcagcgcca gcccggtgct 900 

aacattatct ccaccgccga cagcattcgg cagatgctgc cacagctcac tgagagtctg 960 

ccgaaatcgg tgaaggtgac agtgctttcc gatcgcacca ccaatatccg cgcatccgtc 1020 

gatgatactc agtttgaatt gatgatggct atcgcgctgg tagtcatgat tatctacctg 1080 

tttttgcgca atattccggc gaccatcatt cccggtgttg ctgtaccgct gtcgttaatc 1140 

ggcactttcg cggttatggt gtttctcgat ttttcaatca ataacctgac actgatggcg 1200 

ttaactatcg ccaccggatt cgtggtcgat gacgccatcg tggtgatcga aaacatttcc 1260 

cgctatatcg aaaaaggcga aaaaccgttg gcggcggcgc tcaagggcgc aggtgaaatc 1320 

ggctttacca ttatctcgct gaccttctca ctgattgcgg tgttgatccc actgctgttt 1380 

atgggcgata tcgtcgggcg actgttccgc gaatttgcta ttaccctggc ggtagcgatt 1440 

ttgatctcag cggtggtgtc gctgaccctg acaccgatga tgtgcgcgcg gatgctcagc 1500 

caggagtcgt tgcgtaaaca gaaccgcttc tcccgtgcct cggaaaaaat gttcgacagg 1560 

ataatcgccg cctatggtcg tggactggcg aaagtgctga atcatccgtg gctgacctta 1620 

agcgtggcac tcagcacgct gctgcttagc gtgctgctgt gggtgttcat tccgaaaggt 1680 
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ttcttcccgg tacaggacaa tggcattatt cagggcactt tgcaggcacc gcaatccagc 1740 

tcctttgcca atatggccca gcgacaacgc caggtcgcgg acgtgatttt gcaggatccg 1800 

gcagtgcaaa gcctgacctc atttgttggc gttgatggca ctaacccgtc gctgaacagt 1860 

gcacgtttac aaatcaacct caaaccgttg gatgaacgtg atgatcgggt gcaaaaagtc 1920 

atcgcccgtc tgcaaacggc ggtagataaa gtgccgggcg tcgatctctt cctgcaacca 1980 

acgcaggatc tgactattga tactcaggtc agccgcaccc agtaccagtt taccttgcag 2040 

gccacgtcac tggatgcgct cagtacctgg gtgccacagt tgatggaaaa actccagcaa 2100 

ctgccacagc tttctgatgt ctccagcgac tggcaggaca aagggctggt ggcgtatgtc 2160 

aatgttgatc gcgacagcgc cagccgtctg gggatcagca tggcggatgt cgataacgcc 2220 

ctgtacaacg cgtttggtca gcggctgatt tccactattt atactcaggc caaccagtat 2280 

cgcgtggtgc tggagcacaa caccgaaaat accccaggcc tcgcggcgct ggataccatt 2340 

cgcctgacca gcagcgacgg cggcgtggtg ccgctaagct caattgccaa aattgagcag 2400 

cgttttgcgc cgctctccat caaccatctg gatcagttcc cggtaacgac catctccttt 24 60 

aacgtgccgg ataactattc gctgggcgat gcggtgcagg cgattatgga caccgaaaag 2520 

acgctgaatc tgccggtgga tatcaccacg cagttccagg gcagcaccct cgccttccag 2580 

tcggcgctgg gcagcactgt ctggctgatt gtcgcggcgg tggtggcgat gtatatcgtg 2640 

ctcggcattc tgtacgagag ctttattcac ccgatcacca ttctctcgac gctacccacc 2700 

gcaggggttg gcgcactgct ggcgttgctg attgctggta gcgaactgga tgtgattgcg 2760 

attatcggca ttattttgct gatcggtatc gtgaagaaga acgccatcat gatgatcgac 2820 

ttcgcgctgg ctgctgagcg cgagcaaggc atgtcgccgc gcgaggcaat ctaccaggct 2880 

tgtctgttgc gttttcgtcc gatcctgatg accactctgg cggctctgct tggcgcgctg 2940 

ccgctgatgt tgagtaccgg ggtcggcgcg gaactgcgtc gtccgttagg tatcggcatg 3000 

gtcggcggtc tgattgtcag ccaggtgctg acgctgttta ccacgccggt gatttatttg 3060 

ctgttcgacc gcctggcatt gtggaccaaa agccgctttg cccgtcatga agaggaggcg 3120 

3123 

<210> 123 
<211> 3078 
<212> DNA 
<213> E. Coli 

<400> 123 

gtgaagtttt ttgccctctt catttaccgc ccggtggcga cgattttact gtcggttgcc 60 

attaccctgt gcggcatact gggcttccgt atgctgccgg tcgccccgct gccgcaggtc 120 

gattttccgg tgattatcgt cagcgcctcg ctgcccggtg cgtcaccaga aacaatggcg 180 

tcttccgttg ccacgccgct ggagcgctca cttgggcgca ttgccggagt cagtgaaatg 240 

acctccagca gttcgctcgg cagcacgcgt attattttgc agtttgattt tgaccgggat 300 

atcaacggcg cagcgcgtga tgtgcaggcg gcgatcaacg ctgcacaaag tttgctgccc 360 

agtgggatgc ccagccgccc gacctatcgc aaagcgaacc cgtcggatgc gccaattatg 420 

atcctcacgc tgacgtccga tacttattcg cagggtgaac tgtacgattt cgcctcgacg 480 

cagctggctc cgacgatttc gcaaatcgac ggtgttggtg atgtcgatgt cggaggcagc 540 

tcactgcccg ccgtacgcgt cgggctgaat ccgcaggcgc tgtttaatca gggcgtgtcg 600 

ctggacgacg tacgcaccgc cgtcagcaat gccaacgtgc gtaaaccgca gggcgcgctg 660 

gaagatggca ctcaccgctg gcagatccag accaatgatg agctaaaaac cgccgctgaa 720 

tatcagccgt tgattattca ctacaacaac ggcggcgcgg ttcgtctggg cgatgtggcg 780 

acggtgaccg actcagtgca ggatgtgcgc aacgccggga tgaccaacgc caaaccggct 840 

attttactga tgatccgcaa actgccggaa gccaatatta tccagacggt tgacagcatc 900 

cgggcaaaat taccggagtt gcaggaaacc attccggcgg cgattgatct gcaaattgcc 960 

caggatcgct cccccaccat tcgcgcctcg ctggaagaag tcgagcaaac gctgattatc 1020 

tcggtggcgc tggtgattct ggtggtgttt ttattcctgc gctcgggtcg cgccactatt 1080 

attcccgccg tttcggtgcc ggtttcgctg attggtacgt ttgcggcgat gtacctgtgc 1140 

ggattcagtc tcaataacct ttcgttaatg gcgctcacca tcgctactgg tttcgtggtg 1200 

gatgacgcca tcgtggtgct ggaaaacatt gcacgtcatc tggaagcggg aatgaaaccg 1260 

ttgcaagccg cactgcaagg tactcgcgaa gtcggtttta cggtgctgtc gatgagtctg 1320 

tcactggtgg cggtgttcct gccgctgctg ttgatgggcg gattgccggg ccgactgtta 1380 

cgcgaatttg ccgtgacgct ttctgtcgcc attggtatat cgttgctggt ttctctgaca 1440 

ttaacgccaa tgatgtgtgg ctggatgctg aaagccagca agccgcgcga gcaaaagcga 1500 
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ctgcgtggtt ttggtcgcat gttggtagcc ctgcaacaag gctacggcaa gtcactaaaa 1560 

tgggtgctca atcatacccg tctggtgggc gtggtgctgc ttggcaccat tgcgctgaat 1620 

atctggctgt atatctcgat cccgaaaacc ttcttcccgg agcaggacac tggcgtgttg 1680 

atgggcggga ttcaggcgga tcagagtatt tcgtttcagg cgatgcgcgg taagttgcag 1740 

gatttcatga aaattatccg tgacgatccg gcagtggata atgtcaccgg ctttacaggc 1800 

ggttcgcgag tgaacagcgg gatgatgttt atcaccctca agccacgcga cgaacgcagc 18 60 

gaaacggcgc agcaaattat cgaccgtctg cgcgtaaaac tggcgaaaga accgggggcg 1920 

aatctgttcc tgatggcggt acaggatatt cgcgttggtg ggcgtcagtc gaacgccagc 1980 

taccagtaca cgttgttatc cgacgacctg gcggcactgc gagaatggga gccgaaaatc 2040 

cgcaaaaaac tggcgacgtt gccggaactg gcggacgtga actccgatca gcaggataac 2100 

ggcgcggaga tgaatctggt ttacgaccgc gacaccatgg cacggctggg aatcgacgta 2160 

caagccgcca acagtctgtt aaataacgcc ttcggtcagc ggcaaatctc gaccatttac 2220 

cagccgatga accagtataa agtggtgatg gaagtggatc cgcgctatac ccaggacatc 2280 

agtgcgctgg aaaaaatgtt cgttatcaat aacgaaggca aagcgatccc gctgtcgtat 2340 

ttcgctaaat ggcaaccggc gaatgcccca ctatcggtga atcatcaggg attatcggcg 2400 

gcctcgacca tttcgtttaa cctgccgacc ggaaaatcgc tctcggacgc cagtgcggcg 24 60 

atcgatcgcg caatgaccca gcttggtgtg ccttcgacgg tgcgcggcag ttttgccggc 2520 

acggcgcagg tgttccagga gacgatgaac tcgcaggtga tcctgattat cgccgccatc 2580 

gccacggtgt atatcgtgct gggtatcctt tacgagagtt acgtacatcc gctgacgatt 2640 

ctctccaccc tgccctcggc gggcgttgga gcgctgttgg cgctggagct gttcaatgcc 2700 

ccgttcagcc taatcgccct gatagggatc atgctattaa tcggcatcgt gaagaaaaac 27 60 

gccattatga tggtcgattt tgcgcttgaa gcccaacggc acggtaacct gacgccgcag 2820 

gaagctattt tccaggcctg tctgctgcgt tttcgcccga ttatgatgac taccctggcg 2880 

gcgctgtttg gtgcgctgcc gctggtattg tcgggcggcg acggctcgga gctgcggcaa 2 940 

cccctgggga tcaccattgt cggcggactg gtaatgagcc agctccttac gctgtatacc 3000 

acgccggtgg tgtatctctt tttcgaccgt ctgcggctgc gtttttcgcg taaacctaaa 3060 

caaacggtaa ccgagtaa 3078 

<210> 124 
<211> 1416 
<212> DNA 
<213> E. Coli 

<400> 124 

atgacagatc ttcccgacag cacccgttgg caattgtgga ttgtggcttt cggcttcttt 60 

atgcagtcgc tggacaccac catcgtaaac accgcccttc cctcaatggc gcaaagcctc 120 

ggggaaagtc cgttgcatat gcacatggtc attgtctctt atgtgctgac cgtggcggtg 180 

atgctgcccg ccagcggctg gctggcggac aaagtcggcg tgcgcaatat tttctttacc 240 

gccatcgtgc tgtttactct cggttcactg ttttgcgcgc tttccggcac gctgaacgaa 300 

ctgttgctgg cacgcgcgtt acagggcgtt ggcggcgcga tgatggtgcc ggtcggcaga 360 

ttgacggtga tgaaaatcgt accgcgcgag caatatatgg cggcgatgac ctttgtcacg 420 

ttacccggtc aggtcggtcc gctgctcggt ccggcgctcg gcggtctgct ggtggagtac 480 

gcatcgtggc actggatctt tttgatcaac attccggtgg ggattatcgg tgcgatcgcc 540 

acattgctgt taatgccgaa ctacaccatg cagacgcggc gctttgatct ctccggattt 600 

ttattgctgg cggttggcat ggcggtatta accctggcgc tggacggcag taaaggtaca 660 

ggtttatcgc cgctgacgat tgcaggcctg gtcgcagttg gcgtggtggc actggtgctt 720 

tatctgctgc acgccagaaa taacaaccgt gccctgttca gtctgaaact gttccgtact 780 

cgtacctttt cgctgggcct ggcggggagc tttgccggac gtattggcag tggcatgttg 840 

ccctttatga caccggtttt cctgcaaatt ggcctcggtt tctcgccgtt tcatgccgga 900 

ctgatgatga tcccgatggt gcttggcagc atgggaatga agcgaattgt ggtacaggtg 960 

gtgaatcgct ttggttatcg tcgggtactg gtagcgacca cgctgggtct gtcgctggtc 1020 

accctgttgt ttatgactac cgccctgctg ggctggtact acgttttgcc gttcgtcctg 1080 

tttttacaag ggatggtcaa ctcgacgcgt ttctcctcca tgaacaccct gacgctgaaa 1140 

gatctcccgg acaatctggc gagcagcggc aacagcctgc tgtcgatgat tatgcaattg 1200 

tcgatgagta tcggcgtcac tatcgccggg ctgttgctgg gactttttgg ttcacagcat 1260 

gtcagcgtcg acagcggcac cacacaaacc gtctttatgt acacctggct tagcatggcg 1320 

ttgatcatcg cccttccggc gttcatcttt gccagagtgc cgaacgatac gcatcaaaat 1380 
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gtagctattt cgcggcgaaa aaggagcgcg caatga 



1416 



<210> 125 
<211> 1035 
<212> DNA 
<213> E. Coli 

<400> 125 

atggaaattc gcataatgct atttatatta atgatgatgg ttatgcctgt gagctatgcg 60 

gcatgttata gtgagttatc tgttcagcac aacttggttg ttcaggggga ttttgcactt 120 

actcaaacac aaatggcgac atatgagcat aattttaatg attcgtcatg cgtaagtaca 180 

aatactatca cccctatgag cccgtcggat attattgttg gactttataa cgataccata 240 

aaattaaatt tacattttga atggaccaat aaaaacaaca tcacgttgtc aaataatcag 300 

accagtttca ccagtggtta ttcagttacg gtgacacctg cggccagtaa tgcaaaagtg 360 

aatgtttctg cggggggcgg cggttcagtg atgattaatg gtgttgcgac attatccagt 420 

gcttcatcat cgacacgcgg gagtgccgca gtacaatttc tactgtgttt attaggtggc 480 

aagtcatggg atgcatgtgt aaatagctac agaaatgcat tggcacaaaa tgcaggtgtc 540 

tattccttta atctgacatt gtcatacaac ccgataacca caacctgcaa accggacgat 600 

ttattaatta ctttagacag tattcccgtt tcacaattac cagccacagg taacaaagca 660 

acaataaata gtaaacaagg ggatattatt ctgcgttgta aaaatttatt aggtcaacaa 720 

aatcaaacat cacggaaaat gcaggtgtat ttatcaagtt ctgacttgtt aaccaacagc 780 

aacacaatac tgaaaggtgc ggaagataat ggcgtaggat ttattcttga aagtaatggt 840 

tcgccagtca cacttttaaa tatcactaac agcagtaaag gatatacaaa tttaaaggaa 900 

gttgcggcga agtcaaaact tacagataca acggtttcaa ttccgataac agccagttac 960 

tacgtctacg atacaaacaa agttaaatct ggcgcactgg aggcaaccgc attaatcaac 1020 

gtgaaatacg actaa 1035 

<210> 126 
<211> 2481 
<212> DNA 
<213> E. Coli 

<400> 126 

atgttgagaa tgaccccact tgcatcagca atcgtagcgt tattgctcgg cattgaagct 60 

tatgcagctg aagaaacctt tgatacccat tttatgatag gtggaatgaa agaccagcag 120 

gttgcaaata ttcgtcttga tgataatcaa cccttaccgg ggcagtatga catcgatatt 180 

tatgtcaata agcaatggcg cgggaaatat gagattattg ttaaagacaa cccgcaagaa 240 

acatgtttat caagagaagt tatcaagcgg ttaggcatta atagcgataa cttcgccagc 300 

ggtaagcaat gtttaacatt tgagcaactt gttcagggtg ggagctatac ctgggatatc 360 

ggggtttttc gtctcgattt cagtgtcccg caggcctggg tggaagaact ggaaagtggc 420 

tatgttccac cggaaaactg ggagcggggt attaatgcgt tttatacctc ttattatctg 480 

agtcagtatt acagcgacta taaagcgtcg ggtaataaca agagtacata tgtacgtttt 540 

aacagcgggt taaatttact ggggtggcaa ctgcattctg atgccagttt cagtaaaaca 600 

aataacaatc caggggtgtg gaaaagcaat accctgtatc tggaacgtgg atttgcccaa 660 

cttctcggca cgcttcgcgt gggtgatatg tacacatcaa gcgatatttt tgattctgtt 720 

cgcttcagag gtgtgcggtt gtttcgtgat atgcagatgt tgcctaactc gaaacaaaat 780 

tttacgccac gggtgcaggg gattgctcag agtaacgcgc tggtaactat tgaacagaat 840 

ggttttgtgg tttatcagaa agaggttcct cctggcccgt tcgcgattac agatttgcag 900 

ttggccggtg gtggagcaga tcttgatgtc agcgtgaaag aggcggacgg ctcggtaacc 960 

acctatctgg tgccttatgc agcggtgcca aatatgctgc aacccggcgt gtcgaaatat 1020 

gatttagcgg cgggtcgtag ccatattgaa ggggcgagca aacaaagtga ttttgtccag 1080 

gcgggttatc agtatggttt taataattta ttgacgctgt atggtggctc gatggtcgcg 1140 

aataattatt acgcgtttac tttgggggct ggctggaata cacgcattgg tgccatttcc 1200 

gtcgatgcca ctaagtcgca tagtaaacaa gacaacggcg atgtgtttga cgggcaaagt 1260 

tatcaaattg cctacaacaa atttgtgagc caaacgtcga cgcgttttgg tctggcggcc 1320 

tggcgttatt cgtcgcgtga ttaccggaca tttaacgatc acgtttgggc aaacaataaa 1380 

gataattatc gccgtgatga aaacgatgtc tatgacattg ccgattatta ccagaacgat 1440 
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tttggccgca aaaatagctt ttccgccaat atgagccagt cattgccaga aggttggggg 1500 

tctgtgtcat taagtacgtt atggcgagat tactgggggc gtagcggcag tagtaaggat 1560 

tatcagttga gttattccaa caacctgcga cggataagct ataccctcgc ggcaagccag 1620 

gcttatgacg agaatcatca tgaagagaaa cgttttaata tttttatatc gattcccttt 1680 

gattggggtg atgacgtttc gacgcctcgt cggcaaatat atatgtctaa ctcaacgacg 1740 

tttgatgatc aggggtttgc ctcaaataat acgggattat caggaacagt agggagtcgg 1800 

gatcagttca attatggtgt caacctgagt catcaacatc agggaaatga aacgacagct 1860 

ggggcgaatt tgacctggaa cgcgccggtt gcgacagtga atggcagtta tagtcagtcg 1920 

agtacttatc gacaggctgg agccagtgtt tcagggggca ttgtcgcctg gtcgggtggc 1980 

gttaatctgg cgaaccgtct ttccgaaacg tttgctgtga tgaatgcgcc aggaattaaa 204 0 

gatgcttatg tcaatgggca aaaatatcgc acaacaaacc gtaatggagt ggtgatatac 2100 

gacggaatga caccttatcg ggaaaatcac ctgatgctgg atgtgtcgca aagcgatagc 2160 

gaagcagaat tacgtggcaa ccggaaaatt gccgcccctt atcgcggcgc ggttgtactg 2220 

gttaattttg ataccgatca gcgcaagcca tggtttataa aagcgttaag agcagatggg 2280 

caatcattaa cgtttggtta tgaagtcaat gatatccatg gtcataatat tggcgttgtc 2340 

ggccagggaa gtcagttatt tattcgcacc aatgaagtac cgccatcggt taatgtggca 2400 

attgataagc aacaaggact ttcatgcaca atcaccttcg gtaaagagat tgatgaaagt 24 60 

agaaattata tttgccagta a 2481 

<210> 127 
<211> 720 
<212> DNA 
<213> E. Coli 

<400> 127 

atggccgcta tcccatggcg gccttttaat ttaagaggca ttaaaatgaa aggattatta 60 

tctttactca ttttttctat ggtccttcct gcacatgccg gaattgttat ctacgggacg 120 

cgcattattt acccggcaga aaataaagaa gtgatggtgc agttgatgaa ccagggaaac 180 

cgttcttcgc tgctgcaggc gtggattgat gatggcgata cgtcattacc accagaaaaa 240 

attcaggttc ctttcatgtt aacgccacca gtggcaaaaa taggggcaaa ttccgggcag 300 

caagtaaaaa tcaaaattat gccgaataaa ctgcccacta ataaagaaag cattttttat 360 

ctgaatgttc tggatattcc accaaatagt ccagagcaag aaggtaagaa tgcactgaag 420 

tttgcgatgc aaaacagaat taagttgttt taccggccag cgggtattgc tccggtaaat 480 

aaagcgacat ttaaaaaatt gctggtaaat cgcagtggca atggtttggt gataaaaaat 540 

gactcagcta attgggtgac gatttcggat gtcaaagcta ataatgtcaa agtcaattat 600 

gaaactatta tgattgcccc cttagaaagt cagagtgtta atgtcaaaag taataatgca 660 

aataactggc atctgaccat tatcgatgac catggcaact atattagtga caaaatttaa 720 

<210> 128 
<211> 543 
<212> DNA 
<213> E. Coli 

<400> 128 

atgaaacgtt caattattgc tgccgctgtc ttttcttctt tttttatgag cgctggagta 60 

tttgctgcag acgttgatac cggaacatta actattaagg ggaatattgc agaatctccg 120 

tgtaaattcg aagcgggtgg tgattcagta agtattaata tgccgactgt accaaccagt 180 

gtctttgaag gtaaagctaa atattctacc tatgatgatg cagtcggtgt aaccagcagc 240 

atgttaaaaa ttagctgccc gaaagaagtt gctggtgtaa aactctcgtt gattaccaac 300 

gataaaataa ccggtaacga taaggcgata gccagtagca acgataccgt gggttactat 360 

ctctatttag gtgataacag cgatgtcctg gatgtttctg caccttttaa cattgagagt 420 

tataaaacag cggaaggtca atatgctatt ccgtttaaag caaaatacct gaaactgaca 480 

gataactcag tgcaatcagg tgatgtgtta tcttctctgg ttatgcgtgt ggcgcaggat 540 
taa 

<210> 129 
<211> 339 
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<212> DNA 
<213> E. Coli 



<400> 129 
atgagttcag agcgagatct 
gcagttatag ccggtggtgt 
agctttggct ttccagtaat 
acagttgttt taaatgaaat 
attagagatg gactaaaacg 
ccatttatgt atgttcttaa 

<210> 130 
<211> 582 
<212> DNA 
<213> E. Coli 

<400> 130 

atgactgact acctgttact gtttgtcgga actgtactgg tcaataactt tgtactggtc 60 

aagtttctcg gtctctgtcc gtttatgggg gtttccaaaa aactggaaac cgcgatgggc 120 

atggggctgg caacaacgtt tgtgatgacg ctggcgtcta tttgcgcctg gcttatcgat 180 

acgtggattt tgatcccact taatctgatt tacctgcgca ccctggcatt tattctggtg 240 

attgctgtgg tcgtgcagtt caccgagatg gtggtgcgca aaaccagccc ggtgctttac 300 

cgtctgctgg ggattttttt gccgcttatc accaccaact gtgcagtgct cggcgtggcg 360 

ttgctgaata tcaatctcgg gcacaatttc ttgcagtcgg cgctgtacgg tttttccgcc 420 

gctgtcggtt tctcgctggt gatggtgctc ttcgccgcca tccgcgaacg ccttgctgtg 480 

gctgatgtcc cggcaccttt tcgcggtaat gccattgcgt taattaccgc aggtcttatg 54 0 

tctctggcct ttatgggctt tagtggtttg gtgaagttgt aa 582 

<210> 131 

<211> 579 

<212> DNA 

<213> E. Coli 

<400> 131 

atgaatgcta tctggattgc cgttgccgcc gtgagcctgc tgggcctggc gtttggcgcc 60 

attctgggtt atgcctcccg ccgttttgcg gtggaagacg atccggtcgt tgagaaaatt 120 

gacgaaatct taccgcagag ccagtgtggt cagtgcggtt atcccggctg tcgcccctac 180 

gcggaagcca tcagctgtaa cggtgaaaaa atcaaccgtt gcgccccagg tggcgaagct 240 

gtgatgctaa aaattgccga gttgcttaat gtcgagccgc agccgctgga tggcgaagcg 300 

caagagataa cgcctgcgcg gatggtggcg gttattgatg aaaataactg tattggctgc 360 

actaaatgta ttcaggcgtg tccggtagac gccatcgttg gcgctacccg tgccatgcat 420 

acggtaatga gtgatctctg tacgggctgc aatttatgtg ttgatccgtg cccgacgcac 480 

tgcatctcgt tgcaaccggt cgcagaaaca cctgactcct ggaaatggga tctgaacacc 540 

attcccgtgc gtatcattcc cgtggaacac catgcttaa 579 

<210> 132 
<211> 2223 
<212> DNA 
<213> E. Coli 

<400> 132 

atgcttaagt tattctctgc attcagaaaa aataaaatct gggatttcaa cggcggcatc 60 

catccaccgg agatgaaaac ccagtccaac ggtacacccc tgcgccaggt acccctggcg 120 

cagcgttttg ttattccact gaaacagcat attggcgctg aaggtgagtt gtgcgttagc 180 

gtcggcgata aagtattgcg cggccagccg cttacccgtg gtcgcggcaa aatgctgcct 240 

gttcacgcgc ccacctcggg taccgttacg gctattgcgc cccactctac ggctcatcct 300 

tcagctttag ctgaattaag cgtgattatt gatgccgatg gtgaagactg ctggatcccg 360 



ggttaatttt cttggcgatt tttcaatgga tgtggccaaa 60 

tgcaaccgct attggaagtc tggcttcttt tgcctgtgtt 120 

tcttgtcgga ggagcaattt tactgacagg gatagtgtgt 180 

cgatgctcaa tgccatttat cagaaaaatt aaaatatgca 24 0 

gcaacaggaa cttgataaat ggaaaaggga aaacatgact 300 

cactccaccc gtgatatga 339 



-46- 



cgcgacggct gggccgatta tcgcactcgc agtcgcgaag agttaatcga gcgcatacat 420 

cagtttggtg ttgccgggct gggcggtgca ggattcccga caggcgttaa attgcagggt 480 

ggcggagata agattgaaac gttgattatc aacgcggctg agtgcgagcc gtacattacc 540 

gccgatgacc gtttgatgca ggattgcgcg gctcaggtcg tagagggtat tcgcattctt 600 

gcgcatattc tgcagccacg cgaaattctt atcggcattg aagataacaa accgcaggcg 660 

atttccatgc tgcgcgcggt gctggcggac tctaacgata tttctctgcg ggtgattcca 720 

accaaatatc cttctggcgg tgctaaacaa ttaacctaca ttctgaccgg gaagcaggtt 780 

ccacatggcg ggcgttcatc cgatatcggc gtattaatgc aaaacgtcgg cactgcttat 840 

gcagtgaaac gtgccgttat tgatggcgag ccgattaccg agcgtgttgt aaccctgact 900 

ggcgaagcaa tcgctcgccc gggcaacgtc tgggcacggc tggggacgcc agtgcgtcat 960 

ttattgaatg atgccggatt ctgcccctct gccgatcaaa tggtgattat gggtggcccg 1020 

ctaatgggct ttaccttgcc atggctggat gtcccggtcg taaagattac caactgtctg 1080 

ttggctccct ctgccaatga acttggcgaa ccacaggaag aacaaagctg catccggtgt 1140 

agcgcctgtg ctgacgcctg ccctgcggat cttttgccgc aacagttgta ctggttcagc 1200 

aaaggtcagc aacacgataa agctaccacg cataacattg ctgattgcat tgaatgtggg 1260 

gcttgcgcgt gggtttgccc gagcaatatt cccctggtgc aatatttccg tcaggaaaaa 1320 

gctgaaattg cggctattcg tcaggaagaa aagcgcgccg cagaagccaa agcgcgtttc 1380 

gaagcgcgcc aggctcgtct ggagcgcgaa aaagcggctc gccttgaacg acataagagc 1440 

gcagccgttc aacctgcagc caaagataaa gatgcgattg ctgccgctct ggcgcgggtg 1500 

aaagagaaac aggcccaggc tacacagcct attgtgatta aagcgggcga acgcccggat 1560 

aacagtgcaa ttattgcagc acgggaagcc cgtaaagcgc aagccagagc gaaacaggca 1620 

gaactgcagc aaactaacga cgcagcaacc gttgctgatc cacgtaaaac tgccgttgaa 1680 

gcagctatcg cccgcgccaa agcgcgcaag ctggaacagc aacaggctaa tgcggaacca 1740 

gaacaacagg tcgatccgcg caaagccgcc gtcgaagccg ctattgcccg tgccaaagcg 1800 

cgcaagctgg aacagcaaca ggctaatgcg gaaccagaag aacaggtcga tccgcgcaaa 18 60 

gccgccgtcg aagccgctat tgcccgtgcc aaagcacgca agctggaaca gcaacaggct 1920 

aatgccgagc cagaacaaca ggtcgatccg cgcaaagccg ccgtcgaagc cgctattgcc 1980 

cgagccaaag cgcgcaaacg ggaacagcaa ccggctaatg cggagccaga agaacaggtt 2040 

gatccgcgca aagctgccgt cgaagcggct attgcacgcg ccaaagcacg caagctggaa 2100 

cagcaacagg ctaatgcggt accagaagaa caggttgatc cgcgcaaagc ggcagttgcc 2160 

gcggctattg cccgcgctca ggccaaaaaa gccgcccagc agaaggttgt aaacgaggac 2220 

2223 

<210> 133 
<211> 1059 
<212> DNA 
<213> E. Coli 

<400> 133 

atggtattca gaatagctag ctccccttat acccataacc agcgccagac atcgcgcatt 60 

atgctgttgg tgttgctcgc agccgtgcca ggaatcgcag cgcaactgtg gttttttggt 120 

tggggtactc tcgttcagat cctgttggca tcggttagtg ctctgttagc cgaagctctc 180 

gtactcaaac tacgcaagca gtcggtagcc gcaacgttga aagataactc agcattgctg 240 

acaggcttat tgctggcggt aagtattccc cccctcgcgc catggtggat ggtcgtgctg 300 

ggtacggtgt ttgcggtgat catcgctaaa cagttgtatg gcggtctggg acaaaacccg 360 

tttaatccgg caatgattgg ttatgtggtc ttactgatct ccttccccgt gcagatgacc 420 

agctggttac cgccacatga aattgcggtc aacatccctg gttttatcga cgccatccag 480 

gttattttta gcggtcatac cgccagtggt ggtgatatga acacactacg cttaggtatt 540 

gatggcatta gtcaggcgac accgctggat acatttaaaa cctctgtccg tgccggtcat 600 

tcggttgaac agattatgca atatccgatc tacagcggta ttctggcggg cgctggttgg 660 

caatgggtaa atctcgcctg gctggctggc ggcgtatggt tgctatggca gaaagcgatt 720 

cgctggcata ttcccctcag cttcttagta acgctggcgt tatgcgcaat gttgggctgg 780 

ttgttctcac cagaaacact ggcagcaccg caaattcatc tgctgtctgg agcgaccatg 840 

ctcggcgcat tctttatttt gactgacccg gttaccgctt ctacgaccaa tcgtggtcgt 900 

cttattttcg gcgcgcttgc gggcttatta gtctggttga tccgcagttt cggcggctat 960 

cctgacggcg tggcttttgc cgtcctgctg gcgaacatca cggttcctct gatcgattac 1020 

tacacgcgtc cgcgcgtcta cggccatcgc aaagggtaa 1059 
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<210> 134 
<211> 621 
<212> DNA 
<213> E, Coli 

<400> 134 

atgctgaaaa ctatccgaaa acacggcatt acgttggcgc tatttgcagc gggttcaaca 60 

gggttaactg cggccatcaa ccagatgacc aaaacgacga ttgctgaaca ggccagtctg 120 

caacaaaagg cgttatttga tcaggtgctg ccagccgaac gctataacaa tgcgctggca 180 

cagagttgct atctggtaac tgcgccagag ttaggtaaag gtgagcatcg ggtttacatc 240 

gccaaacagg atgacaaacc ggtagccgcc gttctggaag caaccgcgcc agatggctat 300 

tccggtgcga ttcagctgct ggtgggagcc gattttaacg gcacggtact tggcacgcgc 360 

gtgacagagc accacgaaac gccagggctt ggcgataaaa tcgaactgcg cctttctgac 420 

tggatcaccc attttgcggg taaaaaaatc agtggtgcag atgatgcgca ctgggcggtg 480 

aagaaagatg gtggtgattt cgaccagttc accggcgcga cgattactcc ccgcgcggtg 540 

gttaatgcgg taaaacgcgc cggattgtac gctcagacgt taccggcaca actttctcaa 600 

cttcctgcct gtggagaata a 521 

<210> 135 
<211> 696 
<212> DNA 
<213> E. Coli 

<400> 135 

gtgagcgaaa ttaaagacgt tattgttcag gggttgtgga aaaacaactc tgcgctggtc 60 

cagttgctcg gcctttgtcc tctgttggcg gtcacgtcca ctgccactaa cgctctgggt 120 

ttaggacttg cgactacgct ggtactgacg ctgaccaacc tgaccatttc gacgctgcgt 180 

cactggacgc cagccgagat ccgcattccc atttacgtga tgatcatcgc ctcggtggtc 240 

agcgctgtac agatgctgat caacgcctac gcctttggcc tgtatcaatc attagggatt 300 

tttattccgc tgattgtcac taactgtatc gttgtgggcc gcgctgaagc cttcgccgcc 360 

aaaaaaggtc cggcgctttc ggcactggac ggcttttcaa ttggtatggg cgcaacctgc 420 

gccatgttcg tgctgggttc actacgcgaa attatcggca atggcacatt gtttgacggt 4 80 

gcagatgcgc tgttaggtag ctgggcaaaa gtattacgcg tggagatttt ccacaccgac 540 

tcccctttcc tgctggcgat gctgccacca ggtgcattta ttggcctggg actgatgctg 600 

gcaggaaaat acctgattga tgaaagaatg aaaaagcgcc gtgctgaagc agctgcagaa 660 

cgtgcattgc caaacggtga aacagggaat gtctga 695 

<210> 136 
<211> 636 
<212> DNA 
<213> E. Coli 

<400> 136 

atgaataaag caaaacgcct ggagatcctc actcgcctgc gtgagaacaa tcctcatccc 60 

accaccgagc ttaatttcag ttcgcctttt gaattgctga ttgccgtact gctttccgct 120 

caggcgaccg atgtcagtgt taataaggcg acggcgaaac tctacccggt ggcgaatacg 180 

cctgcagcga tgcttgaact gggcgttgaa ggggtgaaaa cctatatcaa aacgattggg 240 

ctttataaca gcaaagcaga aaatatcatc aaaacctgcc gtatcttgct ggagcagcat 300 

aatggcgagg ttccggaaga tcgtgctgcg cttgaagccc tgcccggcgt aggtcgtaaa 360 

acagccaacg tcgtattaaa cactgcattc ggctggccga ctattgctgt cgacacgcac 420 

attttccgcg tttgtaatcg tactcaattt gcgccgggga aaaacgtcga acaggtagaa 480 

gaaaagctac tgaaagtggt tccagcagag tttaaagtcg actgccacca ttggttgatc 540 

ctgcacgggc gttatacctg cattgcccgc aagccccgct gtggctcttg tattattgaa 600 

gatctttgtg aatacaaaga gaaagttgac atctga 635 

<210> 137 
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<211> 504 
<212> DNA 
<213> E. Coli 



<400> 137 
atgaaaagac ttcacaagag gttcctgtta 
ctccaggccg ccgatgtcac tatcactgtt 
attcaaacca aagaagctaa cgttaatctc 
cctggttctg catctggctg gcacaatatt 
acaagtgcag tgacggcaat cgtgacaggt 
gaaggtactg ccgaaaatat tcagatagag 
aatggcgata gcaaaacggt tattgttgat 
aaggcaagag ctatcacggt gaatggaaac 
aatgtgatct acacctggca ataa 

<210> 138 

<211> 531 

<212> DNA 

<213> E. Coli 

<400> 138 

atgaaataca ataacattat tttcctcggt ttatgtctgg ggttaaccac ctattctgct 60 

ttatccgcag atagcgttat taaaattagc gggcgcgtcc tcgattatgg ctgcacagtc 120 

tcatcggatt cgcttaattt taccgtagat ctccaaaaaa acagtgccag acaatttcca 180 

acgaccggta gcacaagtcc agccgtccct tttcagatta cgttaagtga atgcagcaaa 240 

gggacaacgg gggttcgggt tgcatttaac ggtattgagg atgcagaaaa taatactttg 300 

ttgaaactgg atgaaggaag caatacggcc tccggtttgg gtatagaaat attggacgca 360 

aatatgcgtc cggtgaaact gaatgatctt catgccggga tgcagtggat cccactggta 420 

ccagaacaga acaatatttt gccttactcc gctcgtctga agtcaactca gaagtccgtc 480 

aatccgggac tggtgagggc ttcggcaacc tttacccttg aatttcaata a 531 

<210> 139 
<211> 1149 
<212> DNA 
<213> E. Coli 

<400> 139 

atgagtggtt acaccgtcaa gcctcctacc ggagacacca atgagcagac acaatttatt 60 

gattatttta atctgttcta cagtaagcgt ggtcaggaac aaataagcat ctctcagcag 120 

cttggaaatt acggtacgac atttttcagt gccagtcgcc aaagttactg gaacacgtca 180 

cgcagcgacc agcaaatatc atttggatta aatgtgccgt ttggtgatat tacgacttcg 240 

ctgaattaca gctattccaa taatatatgg caaaacgatc gggatcattt actcgctttt 300 

acgcttaatg ttcccttcag tcattggatg cgtacagaca gtcagtcggc atttcgtaat 360 

tcaaacgcca gttacagtat gtcaaacgat ttgaaaggcg gcatgaccaa tctatcgggg 420 

gtttatggca ctctgctgcc ggataataac ctgaattata gcgttcaggt cggtaacacc 480 

cacggaggta atacatcgtc tggcaccagt ggttacagtt ctcttaatta tcgtggagct 540 

tatggtaata ctaatgtcgg ttacagtcgg agtggtgaca gcagccagat ttattacgga 600 

atgagtggtg ggattattgc tcatgctgat ggcatcacct ttggacagcc gctgggcgac 660 

acaatggttc tggttaaggc tcctggtgct gataatgtca aaatagagaa ccagaccgga 720 

attcataccg actggcgtgg ctatgccata ttaccatttg cgacagaata tagagaaaac 780 

cgtgttgctc ttaacgcgaa ttcccttgca gataatgttg aactggatga aaccgtggtc 840 

actgtcatcc caactcacgg tgctattgcc agagcaacat ttaatgcaca aatcggcggg 900 

aaagtattaa tgacgttgaa gtacggtaat aagagcgttc cattcggtgc aattgtcaca 960 

cacggagaga ataaaaatgg cagcattgtc gcggaaaatg gtcaggttta tctgactgga 1020 

cttccacagt cagggcaatt acaggtttca tggggcaaag ataaaaactc aaactgtatt 1080 

gtcgagtaca agcttcctga agtttctcct ggtaccttac tgaaccagca gacagcaatc 1140 

tgtcgctaa -,149 



gctacgtttt gcgcgttatt 
aatggtcggg tagtcgctaa 
ggggatcttt atacgcgcaa 
actttgtcat taaccgattg 
tcaactgaca atacgggtta 
ctgagggatg accaggatgc 
gagatcactc gtaatgcaca 
gcaagccagg gaacgatcga 



cacagcaact 60 

accctgcact 120 

tctgcaacaa 180 

tccggttgaa 240 

ttacaaaaat 300 

tgcgttaaaa 360 

gtttccactt 420 

ggcgctaatc 480 
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<210> 140 
<211> 417 
<212> DNA 
<213> E. Coli 



<400 
atgattgcga 
ctggcgggta 
ctgctcaacg 
gaagaatcag 
atagccgtga 
attggcagta 
cacgctaaaa 



> 140 
ttgccgacat 
ttcagaacga 
atcctgaaaa 
cgcccgaatt 
ttcgtaccct 
aatctatggt 
aattggcaac 



<210> 141 
<211> 315 
<212> DNA 
<213> E. Coli 



cttgcaagca 
ggaacaatac 
ccccttgctg 
tgcggaattt 
tatggatcaa 
gtcacgcgtt 
gcgattcggc 



ggagaaaagc 
acccaggcgc 
gatctggtgt 
aatgccatgg 
tatggtttaa 
ttgagcggga 
atttctcccg 



taactgctgt 
tggaactggt 
gtgccaaaat 
ctcaagccat 
ccctttccga 
agaggaaatt 
ccttgtttat 



ggcacctttt 
agatcatctg 
aaccgcgtgg 
gcctggcggt 
tctgccggaa 
aacgctggaa 
tgattaa 



60 
120 
180 
240 
300 
360 
417 



<400> 141 

atgcacctga taactcaaaa agcattgaaa gatgctgcgg aaaaataccc gcaacataaa 60 

acggagttgg tggctctggg gaacacgatt gctaagggat atttcaaaaa acctgagtca 120 

ttaaaagcag tattcccatc tctggataac ttcaaatatc tggataagca ttatgttttc 180 

aatgttgggg gcaatgaatt acgtgttgta gcaatggtct tttttgaatc gcaaaagtgc 240 

tacatacgtg aagttatgac gcataaagaa tacgatttct ttaccgctgt tcatcgtact 300 

aaggggaaaa aatga 3^5 

<210> 142 
<211> 7152 
<212> DNA 
<213> E. Coli 



<400> 142 

ttgctatcag tatttacatt ttttcgctgt gctagaaagg gcgcatttat gttagctcgt 60 

tcagggaagg taagcatggc tacgaagaag agaagtggag aagaaataaa tgaccgacaa 120 

atattatgcg ggatgggaat taaactacgc cgcttaactg cgggtatctg tctgataact 180 

caacttgcgt tccctatggc tgcggcagca caaggtgtgg taaacgccgc aacccaacaa 240 

ccagttcctg cacaaattgc cattgcaaat gccaatacgg tgccctacac ccttggagcg 300 

ttggaatcgg cccaaagcgt tgccgaacgt ttcggtattt cggtggctga gttacgcaaa 360 

ctcaaccagt ttcgtacgtt tgctcgaagt tttgataatg tccgccaggg tgatgaactg 420 

gatgtcccgg cacaagttag tgaaaaaaaa ttaaccccgc cgccgggtaa tagcagtgac 480 

aacctcgagc aacagatagc cagtacttca cagcaaatcg ggtctctgct cgccgaagat 540 

atgaacagcg agcaagcggc aaatatggcg cgtggatggg cctcttctca ggcttcaggc 600 

gcaatgacag actggttaag ccgcttcggt accgcaagaa tcacgctggg cgtggatgaa 660 

gattttagcc tgaagaactc ccagttcgat tttctccatc cgtggtatga aacgcctgat 720 

aatctctttt tcagtcagca tactctccat cgtactgacg agcgtacgca gattaacaac 780 

ggcttaggtt ggcgtcattt cactcccaca tggatgtcgg gcatcaactt ctttttcgac 840 

cacgatctta gccgttacca ctcccgcgcc ggcattggcg cggagtactg gcgcgactat 900 

ctaaaattaa gcagtaacgg ctatttgcga ctgaccaact ggcgcagcgc acctgaactg 960 

gacaacgatt atgaagcacg cccggccaat ggctgggatg tacgcgcaga aagctggcta 1020 

cccgcctggc cgcaccttgg cggtaaactg gtctatgaac agtattatgg cgatgaagtg 1080 

gccctgttcg ataaagacga tcggcaaagt aatcctcatg ccataaccgc tggacttaac 1140 

tataccccct tcccgctgat gaccttcagc gcggagcaac gccagggtaa acagggcgaa 1200 

aatgacaccc gttttgccgt cgattttacc tggcaacctg gcagcgcaat gcagaaacag 1260 

cttgacccga atgaagtcgc tgcacggcgt agccttgcag gcagccgtta tgatctggtg 1320 

gatcgcaaca acaatatcgt tctggaatat cgcaaaaaag aactggttcg cctgaccctg 1380 



-50- 



acagaccccg tgacagggaa gtcaggagaa gtgaaatcac tggtttcgtc gctacaaacc 14 4 0 

aaatatgccc tgaaaggcta taacgtcgaa gccaccgcac tggaagctgc cggtggcaaa 1500 

gtggtcacaa cgggtaaaga tattctggtt accctgccgg cttaccggtt caccagtacg 15 60 

ccagaaaccg ataacacctg gccgattgaa gtcaccgccg aagatgtcaa aggcaatttg 1620 

tcgaatcgtg aacagagcat ggtggtcgtt caggcaccta cgctaagcca gaaagattcc 168 0 

tcggtatcgt taagtaccca aacattgaac gcggattccc attcaaccgc cacactgact 1740 

tttattgcgc atgatgcagc aggtaatcct gttgtcgggc tggtgctctc gacgcgtcac 1800 

gaaggtgttc aggacatcac cctttctgac tggaaagata atggtgacgg aagctatacc 1860 

cagatcctga ccacaggagc gatgtctggc acgctgacgc tgatgccaca gctgaatggt 1920 

gtggatgcgg ctaaagcccc cgccgtggtg aatatcattt ctgtttcgtc atcccgaact 1980 

cactcgtcaa ttaagattga taaggaccgt tatctctccg gcaatcctat cgaggtgacg 2040 

gtagaactga gagatgaaaa tgacaaacct gttaaggaac aaaaacagca actgaataac 2100 

gcagtcagca tcgacaacgt gaaaccagga gtcactacag actggaaaga aaccgcagat 2160 

ggcgtctata aggcgaccta taccgcctat accaaaggca gtggacttac tgcgaagcta 2220 

ttaatgcaaa actggaatga agatttgcat accgctggat ttatcatcga cgccaacccg 2280 

cagtcagcga aaattgcgac attatctgcc agcaataatg gtgtgctcgc caatgagaat 2340 

gcagcaaaca ccgtctcggt caatgtcgct gatgaaggaa gcaacccaat caatgatcat 2400 

accgtcacgt ttgcggtatt aagcggatcg gcaacttcct tcaacaatca aaacaccgca 24 60 

aaaacggatg ttaatggtct ggcgactttt gatctgaaaa gtagtaagca ggaagacaac 2520 

acggttgaag tcacccttga aaatggcgtg aaacaaacgt taatcgtcag ttttgtcggc 2580 

gactcgagta ctgcgcaggt tgatctgcag aagtcgaaaa atgaagtggt tgctgacggc 2640 

aatgacagcg tcacaatgac cgcgaccgtc cgggatgcaa aaggcaacct gctcaatgac 2700 

gtcatggtca ctttcaatgt taattcagca gaggcgaaac tgagccaaac cgaagtgaat 2760 

agccacgacg ggatcgccac agctacgctg accagtttga aaaatggtga ttatagggtt 2820 

acggcctctg tgagctctgg ttcccaggct aatcaacagg tgaattttat cggtgatcaa 2880 

agtactgctg ccctgaccct cagtgtgcct tcaggtgata tcaccgtcac caacacagct 2940 

ccgcaatata tgactgcaac cttgcaggat aaaaatggca acccactaaa agataaagaa 3000 

atcaccttct ctgtgccaaa cgacgtcgca agtaagttct cgattagcaa cggaggaaaa 3060 

ggcatgacgg atagtaacgg ggttgcaatc gcctccctga ccggcacgtt agcgggcacg 3120 

catatgatca tggctcgtct ggctaacagc aatgtcagcg atgcacagcc aatgacgttt 3180 

gtggcggata aagacagagc ggttgtcgtt ttgcaaacat cgaaagcgga aatcattggg 3240 

aatggcgtgg atgagacaac tctgacagca acagtgaaag atccgtcgaa tcatccggtg 3300 

gcggggataa cggtaaactt caccatgcca caggacgttg cggcaaactt tacccttgaa 3360 

aataacggta ttgccatcac tcaggccaat ggggaagcgc atgtcacgct gaaaggtaaa 3420 

aaagcgggca cgcatacggt taccgcaacg ctgggtaata acaataccag tgattcgcag 3480 

ccggtaacat ttgtggcgga caaagcctcg gctcaggttg tcctgcagat atcaaaagat 3540 

gagatcacag gtaatggcgt cgatagcgca acgctaactg caacggttaa agatcagttc 3600 

gacaatgagg tgaataatct tccggtaaca ttcagctcag cctcttcagg actcaccctg 3660 

accccgggag taagtaatac caacgagtct ggcatcgcgc aggccactct cgcaggcgtt 3720 

gcctttggtg agaagacggt tactgcatca ctggctaata atggtgccag cgacaacaaa 3780 

actgtgcatt ttattggcga cacagcggcg gcaaaaatta tcgagttggc gcctgtccca 3840 

gacagcataa tcgccggtac cccgcagaac agctccggca gcgtcatcac cgccacagtc 3900 

gttgataata atggctttcc ggtgaaaggt gtgactgtga acttcaccag caacgcagcg 3960 

acagccgaaa tgacgaacgg cggtcaagcc gtgacgaacg aacagggtaa ggctaccgtc 4020 

acttatacca atacccgctc ctcgatagaa tcaggagcga gaccggatac cgttgaggcc 4080 

agtctggaaa atggtagctc cacgcttagc acatcaatta atgtcaacgc tgatgcgtct 4140 

acggcacatc tcaccttgct acaggcactt tttgatacag tctccgcagg cgagacaacc 4200 

agtctgtata ttgaggtgaa ggataattac ggcaacggtg tcccccagca ggaggtaacc 4260 

ctcagcgttt caccaagtga aggcgtgacc cccagtaata acgctatata tactaccaac 4320 

cacgacggca atttttacgc aagctttacc gctacaaaag ccggggttta tcaattgacg 4380 

gcaaccctcg aaaatggcga ttcgatgcaa caaacagtga cctatgtgcc gaacgtcgcg 4440 

aatgctgaaa tcacgctggc agcctcgaag gatccggtga ttgccgacaa taacgatctc 4500 

acgacactaa cagcaacagt cgctgataca gagggcaatg cgatagccaa cactgaggta 4560 

acatttactc tgccggaaga tgtgaaggcg aacttcacgc tgagcgatgg cggtaaagtg 4 620 

attactgatg ctgaaggcaa agcgaaagtc acgctgaaag gtacaaaagc aggcgctcat 4 680 

actgttacag catcgatgac tggcggtaag agtgagcagt tggtggtgaa ctttattgcg 4740 

gatacgctca ctgcgcaggt taatcttaac gttaccgagg acaattttat cgctaataac 4800 
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gtcgggatga ccaggctgca ggcaacagtg actgatggaa acggcaaccc gttagccaat 4860 

gaggcggtga cattcacgct accggcagat gtgagcgcaa gctttactct cggacaaggc 4 920 

ggttccgcca ttactgatat caacggcaag gctgaagtta cactgagcgg tacaaaatcc 4980 

ggcacctacG ccgtgacagt tagcgtgaac aattatggtg tcagtgatac gaaacaggtg 5040 

actttgattg ccgatgctgg taccgcaaaa ctagcctcct taacctctgt atactcattc 5100 

gtcgtcagca cgaccgaggg cgcaaccatg acggcaagcg tcactgacgc taacggcaac 5160 

ccggtagaag gcataaaagt taatttccgc ggaacctccg tcacgctaag cagcaccagc 5220 

gttgaaacgg atgatcgggg tttcgctgaa attcttgtga caagcaccga ggtcggactg 5280 

aaaacagttt cagcctctct ggcagataaa cctactgaag tcatctcgcg attactgaat 534 0 

gccagtgcag atgttaattc tgcgacgatt accagtctgg agataccgga aggtcaggta 5400 

atggtcgcac aagacgtagc agttaaagct cacgttaacg accagtttgg caacccggtt 54 60 

gcgcatcaac ccgtgacatt cagtgcagag ccatcctcgc aaatgatcat cagccagaat 5520 

acggtctcta ctaatacgca gggtgtagcc gaggtcacca tgacgcccga aagaaacggt 558 0 

tcgtatatgg tgaaagcatc cctgccgaat ggagcctcac ttgagaaaca actggaggct 5640 

attgatgaaa aactgacact cacggcgtcc agtccgctta tcggtgtcta tgcccctaca 5700 

ggcgctactc tgacggcaac gctaacctct gcaaatggca ctccagtgga gggtcaggtc 57 60 

atcaacttta gcgtaacgcc agaaggggcg acgttaagtg gcggaaaagt gagaactaac 5820 

tcttcaggtc aggctccagt cgttttgacc agcaataaag tcggtacata tacggtgact 5880 

gcatctttcc ataacggcgt aacaatacag acacagacaa ccgtgaaagt cactggcaac 5940 

tcaagcaccg cccatgttgc tagctttatc gctgatccat cgactatcgc cgccaccaac 6000 

actgatttaa gtaccttaaa ggcaacggtt gaggatggca gtggtaacct gatcgaaggt 6060 

ctcactgtgt acttcgcctt aaaaagcggc tctgccacat taacgtcatt aacagcggtg 6120 

accgatcaaa acggaatcgc gacaacaagc gtgaaaggag cgatgacagg tagcgtcacg 6180 

gtaagcgcag tcacgaccgc tggtggaatg caaacagtag atataacgct ggtggctggc 6240 

ccggcagaca cctcgcagtc cgtccttaag agcaatcggt catcactgaa aggggactat 6300 

accgatagtg ctgaattacg tcttgttctg cacgatatat caggcaatcc gatcaaagtt 6360 

tctgaaggga tggaatttgt gcaatcaggt actaacgtgc cctatataaa aattagcgca 6420 

attgattaca gtctaaatat caacggtgat tacaaagcca ctgttacagg aggcggagag 6480 

ggtatcgcaa cgctgatccc tgtattgaat ggtgttcatc aagctggtct gagtaccaca 6540 

atacaattca ctcgcgcaga agacaaaata atgagcggta cagtatcagt caatggtact 6600 

gacctaccga caactacatt cccttcgcag gggttcaccg gggcgtatta tcagttgaat 6660 

aatgacaact ttgccccagg aaaaacggcg gctgattatg agttttcaag ctctgcctcc 6720 

tgggtcgatg ttgatgctac cggtaaagtg acatttaaaa atgtcggcag caattcggaa 6780 

aggattacgg cgacgccaaa atcaggaggc cctagctatg tatacgaaat ccgtgtgaag 6840 

agttggtggg tgaacgccgg cgaggctttc atgatataca gccttgctga aaatttttgc 6900 

agcagcaatg gctacacgct ccccagagca aactatttaa accactgtag ttcccgaggc 6960 

atcgggtcac tgtacagtga atggggagat atggggcatt acacgactga cgctggtttt 7020 

caatcaaata tgtattggtc atctagtccc gcaaactcaa gcgaacaata cgtagtttcc 7080 

ctggcaacag gtgatcaaag cgtatttgaa aagcttgggt ttgcttatgc gacatgttat 7140 
aaaaacctgt ga 
<210> 143 

<211> 186 

<212> DNA 

<213> E. Coll 

<400> 143 

atgagcaaag gcgcattata tgaatttaac aatccagatc aactgaaaat acctctccct 60 

cataaacaca tagcgtcaac attcaatgac ataatgagta aagatgttgg ttatgcatac 120 

gtatcattac tctatgcctg tcccttaaaa acccactcat taagactgaa tccattcagc 180 

aaatga 3^gg 

<210> 144 
<211> 1197 
<212> DNA 
<213> E. Coli 



<400> 144 
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atgcaggtgg ctgaacagcg cattcagcta gctgaagccc aggcgaaggc agttgccact 60 

caggatggtc cgcagatcga cttttcggcg gatatggagc ggcaaaaaat gtcggcagaa 120 

ggcttaatgg ggccgtttgc tctgaacgat ccggccgcag gtacgaccgg cccgtggtac 180 

accaacggta cttttggctt aacggcgggc tggcatctcg atatctgggg aaagaatcgg 240 

gcggaggtta ctgcccgcct gggtacggtt aaagcacggg cggcggaacg cgagcaaacc 300 

cgccaattgc tggctggcag cgtagcccgc ctgtactggg agtggcaaac ccaggcggcg 360 

ttaaacacgg tcttgcagca aatagaaaaa gagcagaaca ccattatcgc gaccgatcgc 420 

cagctatatc agaacgggat tacttcttca gttgaaggtg tggaaaccga tattaatgcc 480 

agcaaaaccc ggcagcagct caacgatgtc gcggggaaaa tgaaaattat tgaggcacgg 54 0 

ttaagcgcac ttacaaataa ccagacaaag tcattgaagc ttaaaccggt cgcgttgccg 600 

aaagtggcaa gccagcttcc tgatgaactg gggtactcct tactggcccg gcgggcagat 660 

ttgcaggcgg cgcactggta cgttgagtca tcgctaagca ccattgatgc ggcaaaagcg 720 

gcattttatc ctgacatcaa cctgatggcc ttcctgcaac aggatgcgtt gcacttaagc 780 

gatctgttcc gtcattccgc gcagcaaatg ggcgttacgg caggcctgac gctacccatt 840 

ttcgatagtg gtcgtcttaa cgccaatctc gatatcgcaa aagccgaaag caacttgtct 900 

atcgccagct acaacaaagc ggtggttgaa gcggtgaatg acgtggcgcg ggcagccagt 960 

caggttcaga cactggcgga gaaaaaccag catcaggcgc aaattgagcg cgatgccttg 1020 

cgtgtggtag gtcttgcgca ggcgcgcttt aacgcgggca tcattgctgg ttcccgcgtc 1080 

agcgaagcca gaatccccgc gctgcgtgag cgggccaatg gcctgttatt gcaagggcag 1140 

tggctggatg cctccattca actcactggt gcgttgggcg gggggtacaa acgctga 1197 

<210> 145 
<211> 291 
<212> DNA 
<213> E. Coll 

<400> 145 

atgtattgcc acgcgaaact aaaaaatata tcgcaacaca cggtaatctc cgcgcacctt 60 

ttcttacctg attattcccc catgaatcgt gattcctttt atccagccat cgcctgtttt 120 

ccgctgttac tgatgctggc cgggtgtgcg cctatgcatg aaacccgcca ggcgttaagc 180 

cagcaaacgc ccgctgcaca agttgacacc gcattaccca cggcgctgaa aatggttggc 240 

cagacagcca atggtggctg gagtatcacg ataatcaact cacttcctta a 291 

<210> 146 
<211> 948 
<212> DNA 
<213> E. Coll 

<400> 146 

atgcgtgtgt tactggcacc gatggaggga gtgcttgact ctctggtgcg tgaattgctg 60 

accgaagtta acgactacga tctgtgcatc accgagtttg tccgcgtggt ggatcaactg 120 

ctgccggtaa aagtctttca tcgcatttgc cctgagctac aaaacgccag ccggacacca 180 

tctggtacgc tggtgcgcgt gcagttgtta ggtcagttcc cacaatggct ggcagagaac 240 

gccgcccgtg cggtggagtt aggttcctgg ggcgtggatc tcaattgcgg ctgcccgtcg 300 

aaaacggtta acggtagcgg cggcggggcg acgttactca aagatcctga actcatctac 360 

cagggtgcaa aagcgatgcg tgaagctgta ccggcgcatt tgcccgtcag cgtgaaagtg 420 

cgtctgggct gggacagcgg tgagaagaaa tttgaaatcg ccgatgcggt tcaacaggct 480 

ggcgctacgg agctggtggt gcatgggcgg acgaaagagc agggttaccg cgcggagcat 540 

attgactggc aggcgattgg cgatattcgc cagcggctga atattccggt gattgccaac 600 

ggtgaaatct gggactggca gagcgcgcaa caatgcatgg cgatcagcgg ctgcgacgca 660 

gtgatgattg gtcgcggggc gctcaatatt cccaacctga gccgggtggt aaaatataac 720 

gaaccgcgaa tgccgtggcc ggaggtggtt gctttgctgc aaaaatatac ccgtctggaa 780 

aagcagggcg ataccgggtt atatcacgtt gcgcggatta aacagtggtt gagttatttg 840 

cgtaaagaat acgatgaagc aacggaatta tttcagcatg ttcgggtgtt gaataattcc 900 

cctgatattg caagggctat tcaggcaatt gatatcgaga aactctaa 948 

<210> 147 
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<211> 891 
<212> DNA 
<213> E. Coli 



<400> 147 



atgacaatat 
ccagacaccg 
gatttaacga 
tattccgacc 
atagagcacc 
gcggcaatgc 
ctgttttatc 
gccgagcctg 
gtggtgccgg 
aaacatattc 
acagggaaca 
gatgcccagc 
gagaaactga 
gaagaagccc 
atgatggtga 



cgacaacttc cacgccgcat 
cgcgggattt tattgatatt 
cgcttaaact ggaaccaaac 
tcttgtggtc tgtgaaaacg 
aaagtaagcc ggaagaatta 
aaaaccatct tgatgcgggc 
atggttgcag aagtccttat 
ctatagcccg caaaatatat 
atgacgagat tatgcaacac 
gtcagcgcga tctgttggga 
ctaatgacag acagctaaaa 
gttttcgtgc atttattggt 
tgaccattgc tgacagatta 
tgcgtattgc tcaggagatg 
cccgactttc accagacgat 



gatgcggtat 
catcttcccg 
agttttattg 
caggagggag 
atggcttttc 
tataaagagc 
ccttattcac 
tcatcggctt 
cgcaaaatgg 
ttagtcgacc 
gccctgttta 
gagatagcgg 
cgtgaagaag 
ctggatagag 
cttatcgcgc 



ttaaatcttt 
cgccgctgcg 
atgaagacct 
tgggttatat 
gcatgatgcg 
ttccattggt 
tctgctggct 
ttccgttggt 
cgctgttgga 
aaattgtttc 
attacgtatt 
aacgcgcacc 
gcgcaatgca 
gtttagacag 
aaagccacta 



a 



tttacgccat 
caaactgtgt 
gcggcaatat 
ttatgtagtg 
ttattccatt 
gctcccgatg 
tgatgaattt 
ggatattacc 
gttaattcag 
gctgctagtt 
acaaacaggg 
acaagaaaag 
gggcaaacac 
agagttagtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
891 



<210> 148 
<211> 1668 
<212> DNA 
<213> E. Coli 

<400> 148 

gtggctcaat tcgtttatac catgcatcgt gtcggcaaag ttgttccgcc gaaacgtcat 60 

attttgaaaa acatctctct gagtttcttc cctggggcaa aaattggtgt cctgggtctg 120 

aatggcgcgg gtaagtccac cctgctgcgc attatggcgg gcattgataa agacatcgaa 180 

ggtgaagcgc gtccgcagcc agacatcaag attggttatc tgccgcagga accgcagctg 240 

aacccggaac acaccgtgcg tgagtccatt gaagaagcgg tttcagaagt ggttaacgcc 300 

ctgaaacgcc tggatgaagt gtatgcgctg tacgccgatc cggatgccga ttttgacaag 360 

ctggccgctg aacaaggccg tctggaagag atcattcagg ctcacgacgg tcataatctg 420 

aacgtacagc tggagcgtgc ggcggatgcg ctacgtctgc cggactggga cgcgaaaatc 480 

gctaacctct ccggtggtga acgtcgtcgc gtagcgttgt gccgcctgct gctggaaaaa 540 

ccagacatgc tgctgctcga cgaaccgacc aaccacctgg atgccgaatc cgtggcctgg 600 

ctggaacgct tcctgcacga cttcgaaggc accgttgtgg cgattaccca cgaccgttac 660 

ttcctcgata acgttgcagg ctggatcctc gaacttgacc gcggtgaagg tattccgtgg 720 

gaaggtaact actcctcctg gctggagcag aaagatcagc gcctggcgca ggaagcttca 780 

caagaagcgg cgcgtcgtaa gtcgattgag aaagagctgg aatgggtacg tcaaggtact 840 

aaaggccgtc agtcgaaagg taaagcacgt ctggcgcgct ttgaagaact gaacagcacc 900 

gaatatcaga aacgtaacga aaccaacgaa ctgtttattc cacctggacc gcgtctgggc 960 

gataaagtgc tggaagtcag caacctgcgt aaatcctatg gcgatcgtct gctgattgat 1020 

gacctgagct tctcgatccc gaaaggagcg atcgtcggga tcatcggtcc gaacggtgcg 1080 

ggtaaatcga ccctgttccg tatgatctct ggtcaggaac agccggacag cggcaccatc 1140 

actttgggtg aaacggtgaa actggcgtcg gttgatcagt tccgtgactc aatggataac 1200 

agcaaaaccg tttgggaaga agtttccggc gggctggata tcatgaagat cggcaacacc 1260 

gagatgccaa gccgcgccta cgttggccgc tttaacttta aaggggttga tcagggtaaa 1320 

cgcgttggtg aactctccgg tggtgagcgc ggtcgtctgc atctggcgaa gctgctgcag 1380 

gttggcggca acatgctgct gctcgacgaa ccaaccaacg acctggatat cgaaaccctg 1440 

cgcgcgctgg aaaacgccct gctggagttc ccgggctgtg cgatggttat ctcgcacgac 1500 

cgttggttcc tcgaccgtat cgccacgcac attctggatt accaggatga aggtaaagtt 1560 

gagttcttcg aaggtaactt taccgagtac gaagagtaca agaaacgcac gctgggcgca 1620 

gacgcgctgg agccgaagcg tatcaagtac aagcgtattg cgaagtaa 1668 
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<210> 149 
<211> 522 
<212> DNA 
<213> E. Coli 

<400> 149 

atgtcaaagc caaaataccc ttttgaaaag cgccttgaag tcgtgaatca ctacttcaca 60 

actgatgatg gttacaggat catctcggca cgttttggtg tcccccgaac ccaggtcagg 120 

acatgggttg ccctctatga aaaacatgga gaaaaaggtt taattcccaa acctaaaggc 180 

gttagtgctg atccagagtt gcgtattaag gtcgtgaaag ctgtgatcga gcagcacatg 240 

tcccttaatc aggctgctgc tcactttatg cttgctggta gtggttctgt agccaggtgg 300 

ctgaaggtct atgaagagcg cggagaagct ggtttacgcg cgctcaagat tggcaccaaa 360 

agaaacattg caatatcagt tgatccagaa aaagcggcat cagcattgga gctgtcaaaa 420 

gaccgacgca ttgaggatct tgaaaggcaa gttcgatttc ttgaaacgcg gcttatgtat 480 

ctaaaaaagc tgaaagcctt agctcatccc acgaaaaagt ga 522 

<210> 150 

<211> 852 

<212> DNA 

<213> E. Coli 

<400> 150 

gtgaaagtac tcaacgagct aaggcagttt tatcctcttg atgagcttct cagggctgcg 60 

gagataccgc gcagtacgtt ttattatcat ctaaaggctc tcagcaagcc tgacaagtat 120 

gcggacgtta aaaagcgtat tagtgagatt tatcacgaga atagaggccg atacggatac 180 

cgtagggtaa cgctgtctct tcatcgagaa gggaaacaga ttaaccataa agctgttcag 240 

cgcctgatgg gaaccctctc acttaaagca gcgattaagg tcaagcgata ccgctcttac 300 

agaggagagg tagggcaaac cgcccctaat gttctccaaa gagatttcaa ggctacgcgg 360 

ccaaacgaga agtgggttac cgatgttact gaatttgcag tcaatgggcg caagctgtat 420 

ttgtctccag taatagatct cttcaacaac gaagttattt cttacagcct ttcggaaaga 480 

ccagtgatga acatggttga gaatatgctc gatcaggcat tcaaaaagct taatcctcac 540 

gagcatcctg ttctgcactc tgaccaggga tggcagtatc gtatgagaag atatcaaaat 600 

atccttaaag aacatggtat taaacaaagc atgtccagaa aaggcaattg tctggataat 660 

gctgtggtgg agtgtttctt tggaacctta aagtcggagt gtttttatct tgatgagttc 720 

agtaatataa gcgaactgaa ggatgctgtt acggaatata ttgaatacta caacagcaga 780 

agaattagcc tgaaattaaa aggtctgact ccaattgaat atcggaatca gacctatatg 840 

cctcgtgttt aa 352 

<210> 151 
<211> 117 
<212> DNA 
<213> E. Coli 

<400> 151 

atgaaagttc gtgcttccgt caagaaatta tgccgtaact gcaaaatcgt taagcgtgat 60 
ggtgtcatcc gtgtgatttg cagtgccgag ccgaagcata aacagcgcca aggctga 117 

<210> 152 
<211> 1332 
<212> DNA 
<213> E, Coli 

<400> 152 

atggctaaac aaccgggatt agattttcaa agtgccaaag gtggcttagg cgagctgaaa 60 

cgcagactgc tgtttgttat cggtgcgctg attgtgttcc gtattggctc ttttattccg 120 

atccctggta ttgatgccgc tgtacttgcc aaactgcttg agcaacagcg aggcaccatc 180 

attgagatgt ttaacatgtt ctctggtggt gctctcagcc gtgcttctat ctttgctctg 240 
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gggatcatgc cgtatatttc ggcgtcgatc attatccagc tgctgacggt ggttcaccca 300 

acgttggcag aaattaagaa agaaggggag tctggtcgtc gtaagatcag ccagtacacc 360 

cgctacggta ctctggtgct ggcaatattc cagtcgatcg gtattgctac cggtctgccg 420 

aatatgcctg gtatgcaagg cctggtgatt aacccgggct ttgcattcta cttcaccgct 4 80 

gttgtaagtc tggtcacagg aaccatgttc ctgatgtggt tgggcgaaca gattactgaa 540 

cgaggtatcg gcaacggtat ttcaatcatt atcttcgccg gtattgtcgc gggactcccg 600 

ccagccattg cccatactat cgagcaagcg cgtcaaggcg acctgcactt cctcgtgttg 660 

ctgttggttg cagtattagt atttgcagtg acgttctttg ttgtatttgt tgagcgtggt 720 

caacgccgca ttgtggtaaa ctacgcgaaa cgtcagcaag gtcgtcgtgt ctatgctgca 780 

cagagcacac atttaccgct gaaagtgaat atggcggggg taatcccggc aatcttcgct 840 

tccagtatta ttctgttccc ggcgaccatc gcgtcatggt tcgggggcgg tactggttgg 900 

aactggctga caacaatttc gctgtatttg cagcctgggc aaccgcttta tgtgttactc 960 

tatgcgtctg caatcatctt cttctgtttc ttctacacgg cgttggtttt caacccgcgt 1020 

gaaacagcag ataacctgaa gaagtccggt gcatttgtac caggaattcg tccgggagag 1080 

caaacggcga agtatatcga taaagtaatg acccgcctga ccctggttgg tgcgctgtat 1140 

attaccttta tctgcctgat cccggagttc atgcgtgatg caatgaaagt accgttctac 1200 

ttcggtggga cctcactgct tatcgttgtt gtcgtgatta tggactttat ggctcaagtg 1260 

caaactctga tgatgtccag tcagtatgag tctgcattga agaaggcgaa cctgaaaggc 1320 

tacggccgat aa 1332 

<210> 153 
<211> 435 
<212> DNA 
<213> E. Coli 

<400> 153 

atgcgtttaa atactctgtc tccggccgaa ggctccaaaa aggcgggtaa acgcctgggt 60 

cgtggtatcg gttctggcct cggtaaaacc ggtggtcgtg gtcacaaagg tcagaagtct 120 

cgttctggcg gtggcgtacg tcgcggtttc gagggtggtc agatgcctct gtaccgtcgt 180 

ctgccgaaat tcggcttcac ttctcgtaaa gcagcgatta cagccgaaat tcgtctgtct 240 

gacctggcta aagtagaagg cggtgtagta gacctgaaca cgctgaaagc ggctaacatt 300 

atcggtatcc agatcgagtt cgcgaaagtg atcctggctg gcgaagtaac gactccggta 360 

actgttcgtg gcctgcgtgt tactaaaggc gctcgtgctg ctatcgaagc tgctggcggt 420 

aaaatcgagg aataa 435 

<210> 154 
<211> 180 
<212> DNA 
<213> E. Coli 

<400> 154 

atggcaaaga ctattaaaat tactcaaacc cgcagtgcaa tcggtcgtct gccgaaacac 60 

aaggcaacgc tgcttggcct gggtctgcgt cgtattggtc acaccgtaga gcgcgaggat 120 

actcctgcta ttcgcggtat gatcaacgcg gtttccttca tggttaaagt tgaggagtaa 180 

<210> 155 
<211> 504 
<212> DNA 
<213> E. Coli 

<400> 155 

atggctcaca tcgaaaaaca agctggcgaa ctgcaggaaa agctgatcgc ggtaaaccgc 60 

gtatctaaaa ccgttaaagg tggtcgtatt ttctccttca cagctctgac tgtagttggc 120 

gatggtaacg gtcgcgttgg ttttggttac ggtaaagcgc gtgaagttcc agcagcgatc 180 

cagaaagcga tggaaaaagc ccgtcgcaat atgattaacg tcgcgctgaa taacggcact 240 

ctgcaacacc ctgttaaagg tgttcacacg ggttctcgcg tattcatgca gccggcttcc 300 

gaaggtaccg gtatcatcgc cggtggtgca atgcgcgccg ttctggaagt cgctggggtt 360 
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cataacgttc tggctaaagc ctatggttcc accaacccga tcaacgtggt tcgtgcaact 
attgatggcc tggaaaatat gaattctcca gaaatggtcg ctgccaagcg tggtaaatcc 
gttgaagaaa ttctggggaa ataa 



420 
480 
504 



<210> 156 
<211> 354 
<212> DNA 
<213> E. Coli 

<400> 156 

atggataaga aatctgctcg tatccgtcgt gcgacccgcg cacgccgcaa gctccaggag 60 

ctgggcgcaa ctcgcctggt ggtacatcgt accccgcgtc acatttacgc acaggtaatt 120 

gcaccgaacg gttctgaagt tctggtagct gcttctactg tagaaaaagc tatcgctgaa 180 

caactgaagt acaccggtaa caaagacgcg gctgcagctg tgggtaaagc tgtcgctgaa 240 

cgcgctctgg aaaaaggcat caaagatgta tcctttgacc gttccgggtt ccaatatcat 300 

ggtcgtgtcc aggcactggc agatgctgcc cgtgaagctg gccttcagtt ctaa 354 

<210> 157 
<211> 534 
<212> DNA 
<213> E. Coli 

<400> 157 

atgtctcgtg ttgctaaagc accggtcgtt gttcctgccg gcgttgacgt aaaaatcaac 60 

ggtcaggtta ttacgatcaa aggtaaaaac ggcgagctga ctcgtactct caacgatgct 120 

gttgaagtta aacatgcaga taataccctg accttcggtc cgcgtgatgg ttacgcagac 180 

ggttgggcac aggctggtac cgcgcgtgcc ctgctgaact caatggttat cggtgttacc 240 

gaaggcttca ctaagaagct gcagctggtt ggtgtaggtt accgtgcagc ggttaaaggc 300 

aatgtgatta acctgtctct gggtttctct catcctgttg accatcagct gcctgcgggt 360 

atcactgctg aatgtccgac tcagactgaa atcgtgctga aaggcgctga taagcaggtg 420 

atcggccagg ttgcagcgga tctgcgcgcc taccgtcgtc ctgagcctta taaaggcaag 480 

ggtgttcgtt acgccgacga agtcgtgcgt accaaagagg ctaagaagaa gtaa 534 

<210> 158 
<211> 393 
<212> DNA 
<213> E. Coli 

<400> 158 

atgagcatgc aagatccgat cgcggatatg ctgacccgta tccgtaacgg tcaggccgcg 60 

aacaaagctg cggtcaccat gccttcctcc aagctgaaag tggcaatcgc caacgtgctg 120 

aaggaagaag gttttattga agattttaaa gttgaaggcg acaccaagcc tgaactggaa 180 

cttactctga agtatttcca gggcaaagct gttgtagaaa gcattcagcg tgtcagccgc 240 

ccaggtctgc gcatctataa acgtaaagat gagctgccga aagttatggc gggtctgggt 300 

atcgcagttg tttctacctc taaaggtgtt atgactgatc gtgcagcgcg ccaggctggt 360 

cttggtggcg aaattatctg ctacgtagcc taa 393 

<210> 159 
<211> 306 
<212> DNA 
<213> E. Coli 

<400> 159 

atggctaagc aatcaatgaa agcacgcgaa gtaaaacgcg tagctttagc tgataaatac 60 

ttcgcgaaac gcgctgaact gaaagcgatc atctctgatg tgaacgcttc cgacgaagat 120 

cgttggaacg ctgttctcaa gctgcagact ctgccgcgtg attccagccc gtctcgtcag 180 

cgtaaccgct gccgtcaaac aggtcgtccg catggtttcc tgcggaagtt cgggttgagc 240 
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cgtattaagg tccgtgaagc cgctatgcgc ggtgaaatcc cgggtctgaa aaaggctagc 
tggtaa 



300 
306 



<210> 160 
<211> 540 
<212> DNA 
<213> E. Coli 

<400> 160 

atggcgaaac tgcatgatta ctacaaagac gaagtagtta aaaaactcat gactgagttt 60 

aactacaatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa catgggtgtt 120 

ggtgaagcga tcgctgacaa aaaactgctg gataacgcag cagcagacct ggcagcaatc 180 

tccggtcaaa aaccgctgat caccaaagca cgcaaatctg ttgcaggctt caaaatccgt 240 

cagggctatc cgatcggctg taaagtaact ctgcgtggcg aacgcatgtg ggagttcttt 300 

gagcgcctga tcactattgc tgtacctcgt atccgtgact tccgtggcct gtccgctaag 360 

tctttcgacg gtcgtggtaa ctacagcatg ggtgtccgtg agcagatcat cttcccagaa 420 

atcgactacg ataaagtcga ccgcgttcgt ggtttggata ttaccattac cactactgcg 480 

aaatctgacg aagaaggccg cgctctgctg gctgcctttg acttcccgtt ccgcaagtaa 540 

<210> 161 

<211> 315 

<212> DNA 

<213> E. Coli 

<400> 161 

atggcagcga aaatccgtcg tgatgacgaa gttatcgtgt taaccggtaa agataaaggt 60 

aaacgcggta aagttaagaa tgtcctgtct tccggcaagg tcattgttga aggtatcaac 120 

ctggttaaga aacatcagaa gccggttccg gccctgaacc aaccgggtgg catcgttgaa 180 

aaagaagccg ctattcaggt ttccaacgta gcaatcttca atgcggcaac cggcaaggct 240 

gaccgtgtag gctttagatt cgaagacggt aaaaaagtcc gtttcttcaa gtctaacagc 300 

gaaactatca agtaa 215 

<210> 162 
<211> 372 
<212> DNA 
<213> E. Coli 

<400> 162 

atgatccaag aacagactat gctgaacgtc gccgacaact ccggtgcacg tcgcgtaatg 60 

tgtatcaagg ttctgggtgg ctcgcaccgt cgctacgcag gcgtaggcga catcatcaag 120 

atcaccatca aagaagcaat tccgcgtggt aaggtcaaaa aaggtgatgt gctgaaggcg 180 

gtagtggtgc gcaccaagaa gggtgttcgt cgcccggacg gttctgtcat tcgcttcgat 240 

ggtaatgctt gtgttcttct gaacaacaac agcgagcagc ctatcggtac gcgtattttt 300 

gggccggtaa ctcgtgagct tcgtagtgag aagttcatga aaattatctc tctggcacca 360 

gaagtactct aa 372 

<210> 163 
<211> 567 
<212> DNA 
<213> E. Coli 

<400> 163 

atgtttaaag gacaaaaaac attggccgca ctggccgtat ctctgctgtt cactgcacct 60 

gtttatgctg ctgatgaagg ttctggcgaa attcacttta agggggaggt tattgaagca 120 

ccttgtgaaa ttcatccaga agatattgat aaaaacatag atcttggaca agtcacgaca 180 

acccatataa accgggagca tcatagcaat aaagtggccg tcgacattcg cttgatcaac 240 

tgtgatctgc ctgcttctga caacggtagc ggaatgccgg tatccaaagt tggcgtaacc 300 
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ttcgatagca cggctaagac aactggtgct acgcctttgt tgagcaacac cagtgcaggc 
gaagcaactg gggtcggtgt acgactgatg gacaaaaatg acggtaacat cgtattaggt 
tcagccgcgc cagatcttga cctggatgca agctcatcag aacagacgct gaactttttc 
gcctggatgg aacaaattga taatgcagtc gatgtcacgg caggtgaagt aaccgctaac 
gcaacctacg tgctggatta taaataa 



360 
420 
480 
540 
567 



<210> 164 
<211> 1284 
<212> DNA 
<213> E. Coli 

<400> 164 

atggctgata caaaagcaaa actcaccctc aacggggata cagctgttga actggatgtg 60 

ctgaaaggca cgctgggtca agatgttatt gatatccgta ctctcggttc aaaaggtgtg 120 

ttcacctttg acccaggctt cacttcaacc gcatcctgcg aatctaaaat tacttttatt 180 

gatggtgatg aaggtatttt gctgcaccgc ggtttcccga tcgatcagct ggcgaccgat 240 

tctaactacc tggaagtttg ttacatcctg ctgaatggtg aaaaaccgac tcaggaacag 300 

tatgacgaat ttaaaactac ggtgacccgt cataccatga tccacgagca gattacccgt 360 

ctgttccatg ctttccgtcg cgactcgcat ccaatggcag tcatgtgtgg tattaccggc 420 

gcgctggcgg cgttctatca cgactcgctg gatgttaaca atcctcgtca ccgtgaaatt 4 80 

gccgcgttcc gcctgctgtc gaaaatgccg accatggccg cgatgtgtta caagtattcc 540 

attggtcagc catttgttta cccgcgcaac gatctctcct acgccggtaa cttcctgaat 600 

atgatgttct ccacgccgtg cgaaccgtat gaagttaatc cgattctgga acgtgctatg 660 

gaccgtattc tgatcctgca cgctgaccat gaacagaacg cctctacctc caccgtgcgt 720 

accgctggct cttcgggtgc gaacccgttt gcctgtatcg cagcaggtat tgcttcactg 780 

tggggacctg cgcacggcgg tgctaacgaa gcggcgctga aaatgctgga agaaatcagc 840 

tccgttaaac acattccgga atttgttcgt cgtgcgaaag acaaaaatga ttctttccgc 900 

ctgatgggct tcggtcaccg cgtgtacaaa aattacgacc cgcgcgccac cgtaatgcgt 960 

gaaacctgcc atgaagtgct gaaagagctg ggcacgaagg atgacctgct ggaagtggct 1020 

atggagctgg aaaacatcgc gctgaacgac ccgtacttta tcgagaagaa actgtacccg 1080 

aacgtcgatt tctactctgg tatcatcctg aaagcgatgg gtattccgtc ttccatgttc 1140 

accgtcattt tcgcaatggc acgtaccgtt ggctggatcg cccactggag cgaaatgcac 1200 

agtgacggta tgaagattgc ccgtccgcgt cagctgtata caggatatga aaaacgcgac 1260 

tttaaaagcg atatcaagcg ttaa 1284 

<210> 165 
<211> 1434 
<212> DNA 
<213> E. Coli 



<400> 165 



atgaaagtaa 
ctggatcgtt 
gttaaagtga 
gcttctctcg 
gcgctgagta 
catccgacca 
tttgaagaag 
ctgagttcga 
cagcagatga 
accgattttg 
gctgttgtcg 
gccgattacg 
caaccgggta 
ggtgcgggcg 
gaagaagcct 
tccacggttt 



cgctgccaga gtttgaacgt gcaggagtga tggtggttgg tgatgtgatg 
actggtacgg ccccaccagt cgtatctcgc cggaagcgcc ggtgcccgtg 
ataccatcga agaacgtccg ggcggcgcgg ctaacgtggc gatgaatatc 
gtgctaatgc acgcctggtc gggttgacgg gcattgacga tgcagcgcgc 
aatctctggc cgacgtcaac gtcaaatgcg acttcgtttc tgtaccgacg 
ttaccaaatt acgggtactt tcccgcaacc aacagctgat ccgtctggat 
gtttcgaagg tgttgatccg cagccgctgc acgagcggat taatcaggcg 
ttggcgcgct ggtgctttct gactacgcca aaggtgcgct ggcaagcgta 
tccaactggc gcgtaaagcg ggtgttccgg tgctgattga tccaaaaggt 
agcgctaccg cggcgctacg ctgttaacgc cgaatctctc ggaatttgaa 
gtaaatgtaa gaccgaagaa gagattgttg agcgcggcat gaaactgatt 
aactctcggc tctgttagtg acccgttccg aacagggtat gtcgctgctg 
aagcgccgct gcatatgcca acccaagcgc aggaagtgta tgacgttacc 
acacggtgat tggcgtcctg gcggcaacgc tggcagcggg taattcgctg 
gcttctttgc caatgcggcg gctggcgtgg tggtcggcaa actgggaacc 
cgccgatcga gctggaaaat gctgtacgtg gacgtgcaga tacaggcttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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ggcgtgatga ccgaagagga actgaagctg 
aaagtggtga tgaccaacgg tgtctttgac 
gcaaatgccc gcaagctggg tgaccgcttg 
aaacggctga aaggggattc ccgcccggta 
ggcgcactgg aagcggtcga ctgggtagtg 
atcgccggga tcttgccaga tctgctggtg 
gccgggagta aagaagtctg ggccaacggt 
ggttgctcga cgaccaacat catcaagaag 

<210> 166 
<211> 2841 
<212> DNA 
<213> E. Coli 

<400> 166 

atgaagccgc tctcttcacc gttacagcag tactggcaga ccgttgttga gcggctgcca 60 

gagcctttag ccgaggaatc acttagcgca caggcgaagt cagtacttac ttttagtgat 120 

tttgtgcagg acagcgtgat tgcgcatcca gagtggctga cggaactgga aagccaaccg 180 

ccgcaggccg acgaatggca gcattacgcg gcatggttgc aggaggcgct ctgtaatgtg 240 

agtgacgaag ccgggttaat gcgcgagctg cggctattcc ggcggcgcat tatggtgcgc 300 

atcgcctggg cgcaaacgct ggcactggtt actgaagaga gcatattgca gcagctcagc 360 

tatctggcgg agacgctgat tgttgcggcg cgtgactggc tgtatgacgc ctgctgccgc 420 

gagtggggaa cgccgtgcaa tgcgcagggc gaagcgcaac cgctgctgat tttaggcatg 480 

ggtaagctgg gcggtgggga gctgaatttc tcctctgata tcgatctgat ttttgcctgg 540 

ccggaacatg gttgtacgca gggtggacgc cgggaactgg ataacgcgca gttttttacc 600 

cgcatggggc agcggctgat taaagtgctg gatcaaccaa cgcaggatgg cttcgtctat 660 

cgcgtggata tgcggctgcg tccgtttggc gaaagtggcc cgctggtgct gagctttgcc 720 

gcgttggaag attattacca ggagcagggg cgcgactggg agcgttacgc gatggtcaag 780 

gcgcggatta tgggcgatag cgaaggcgtc tatgctaacg agttgcgtgc gatgctgcgc 840 

ccgtttgttt tccgtcgtta catcgatttc agcgtgattc agtcgctgcg caacatgaaa 900 

gggatgattg cccgtgaagt gcgtcgacgt ggtttgaccg acaatatcaa actcggcgca 960 

ggcggcattc gcgaaattga atttatcgtt caggtgttcc agctcattcg cggcggacgc 1020 

gaaccgtcgc tgcaatcgcg ctctttactg ccaacgctca gcgccattgc cgagctgcat 1080 

ctgctttctg aaaacgatgc tgaacaattg cgagtggcgt atctgttcct gcggcgtctg 1140 

gaaaacctgc tgcaaagcat taacgacgaa caaacccaga cgcttccttc tgatgagctt 1200 

aatcgtgcgc ggctggcgtg ggcgatggac tttgctgact ggccgcaact gaccggggcg 1260 

ctgaccgcac atatgaccaa tgtgcgccgg gtgtttaatg aattgattgg cgacgatgaa 1320 

agtgaaactc aggaagagtc gctgtcggaa cagtggcgtg agctgtggca ggatgcgttg 1380 

caggaagatg acactacgcc agtgctggcg catcttagcg aggatgatcg caaacaggtg 1440 

ctaacgctga ttgccgattt ccgcaaagag ctggataagc gcaccatcgg gccgcgagga 1500 

cgtcaggtgc tcgaccatct gatgccgcat ctgctaagtg atgtctgtgc gcgtgaagac 1560 

gctgccgtta cgctgtcgcg cattaccgcc ttgctggtgg ggattgttac ccgcaccacc 1620 

tatttagaat tgctcagtga attccccgcg gcgcttaaac atttgatttc tctgtgtgcc 1680 

gcgtcgccga tgattgccag ccagctggcg cgttatccat tattgctgga tgaattgctc 1740 

gatccaaaca ccctttacca gccgacggcg accgatgcct accgcgatga gttgcgccag 1800 

tatttgctgc gcgtgccgga agatgacgaa gagcaacagc ttgaggcgct gcgtcagttc 1860 

aaacaggcgc agctgttacg catcgccgca gcggatatcg ccggtacgct accggtgatg 1920 

aaagtgagcg atcacttaac ctggctggcg gaagccatga tagatgccgt cgttcagcag 1980 

gcgtgggttc aaatggttgc ccgctacggt aagccgaatc acctgaacga acgcgaaggg 2040 

cgtggttttg cggtggtcgg ctacggcaag ctgggcggct gggagttagg ctacagttcc 2100 

gatcttgacc ttatcttcct ccatgattgc ccaatggatg cgatgactga cggtgagcgg 2160 

gaaatcgacg ggcggcagtt ttatctgcgt ctggcgcaac gcattatgca tctgttcagt 2220 

acgcgtacct cttccggcat tttgtatgaa gtggatgctc gactgcgtcc gtccggggcg 2280 

gcgggaatgc tggtgacatc cgcagaagca tttgccgatt atcagaaaaa cgaggcctgg 2340 

acgtgggaac atcaggcgct ggtgcgtgcg cgtgtagtgt acggcgatcc gcagctcacc 2400 

gcgcactttg acgcagtgcg tcgcgagatt atgacgctgc cgcgtgaagg taaaactctg 24 60 

caaacggaag tgcgggaaat gcgcgagaaa atgcgcgctc atctcggcaa taaacatcgc 2520 



gccgtagcgg cagcgcgtaa 
atcctgcacg ccgggcacgt 
attgttgccg tcaacagcga 
aacccactcg aacagcgtat 
tcgtttgaag aggacacgcc 
aaaggcggcg actataaacc 
ggcgaagtgt tggtgctcaa 
atccaacagg ataaaaaagg 



acgtggtgaa 1020 

ctcttatctg 1080 

tgcctccacc 1140 

gattgtgctg 1200 

gcagcgcttg 1260 

agaagagatt 1320 

ctttgaagac 1380 

ctaa 1434 
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gatcgctttg atatcaaagc tgatgaaggg ggaattaccg atatcgaatt tattacccaa 2580 

tatctggtgt tgcgctacgc tcatgaaaaa ccgaagttaa cgcgctggtc agacaacgtg 2640 

cgtattctgg aactactggc gcaaaacgac attatggaag agcaggaagc gatggcgctg 2700 

acccgtgctt acactacgct tcgcgatgaa cttcatcatc tggcattaca ggaattgccg 2760 

ggccatgtgt cggaggattg cttcaccgca gagcgtgaac tggtgcgggc aagctggcag 2820 

aagtggctgg tggaagaatg a 2841 

<210> 167 
<211> 1302 
<212> DNA 
<213> E. Coli 

<400> 167 

atggctcagg aaatcgaatt aaagtttatt gttaatcaca gtgccgttga ggcgttgcgt 60 

gaccatctca atacgctggg cggcgagcac catgaccccg tgcagttgct gaatatttac 120 

tacgaaacgc cggataactg gctgcgtggg cacgatatgg gcttacgtat tcgtggcgaa 180 

aacggtcgct atgagatgac catgaaagtt gcaggaagag tgacaggcgg cttacatcag 240 

cgcccggaat ataacgtggc gttgagcgaa ccgacgctcg acctggcgca gttaccgacg 300 

gaagtctggc cgaacggcga attgcccgcc gatctcgcct cccgcgtgca gccgctgttc 360 

agcaccgatt tttatcgcga aaaatggctg gtggcggtcg atggtagcca aattgaaatc 420 

gccctcgacc agggggaagt gaaagcgggt gaatttgctg aacctatctg tgagctggaa 4 80 

ctggaactgc ttagcggcga cacgcgcgcg gtgctgaaac tggcgaacca actggtatcg 540 

caaaccggat tacgccaggg cagcctgagc aaagcggcgc gtggctatca tctggcgcag 600 

ggcaatccgg cgcgtgaaat caaaccgacc accattttgc atgttgcggc aaaagccgat 660 

gtggaacagg ggctggaagc ggcgctcgag ctggcgttag cgcaatggca gtatcatgaa 720 

gaactgtggg tacgcggcaa cgatgcggcg aaagaacagg tgctggcagc cattagcctg 780 

gtccgtcata cgctgatgct gttcggtggt attgtgccgc gtaaagcgag cactcactta 840 

cgtgatctgc tgactcaatg cgaggcgacc attgcttctg cggtgtctgc cgtgacggcg 900 

gtctactcta ccgaaacggc aatggcgaag ctggcgttga ccgaatggtt ggtaagcaaa 960 

gcatggcagc catttttaga tgccaaagcg cagggcaaaa tcagcgactc cttcaaacgc 1020 

tttgccgata tccatctttc ccgccatgcc gctgaactga aaagcgtttt ctgccagccg 1080 

ttaggcgatc gctaccgtga ccagttgcca cgcctgacgc gtgatattga ctcaatactg 1140 

ttgctggcgg gttactatga tcctgtcgtc gcgcaagcct ggctggagaa ctggcagggg 1200 

ctgcatcacg ctattgcgac cgggcaacgc atcgaaattg aacatttccg taatgaggca 1260 

aacaatcagg aaccgttctg gttgcacagc ggaaaacgtt aa 1302 

<210> 168 
<211> 213 
<212> DNA 
<213> E. Coli 

<400> 168 

atgtccggta aaatgactgg tatcgtaaaa tggttcaacg ctgacaaagg cttcggcttc 60 

atcactcctg acgatggctc taaagatgtg ttcgtacact tctctgctat ccagaacgat 120 

ggttacaaat ctctggacga aggtcagaaa gtgtccttca ccatcgaaag cggcgctaaa 180 

ggcccggcag ctggtaacgt aaccagcctg taa 213 

<210> 169 
<211> 1572 
<212> DNA 
<213> E. Coli 

<400> 169 

atgagggaca ttgtggaccc tgtattctct atcggtatct catcattatg ggatgagctg 60 

cgacatatgc cagcaggcgg cgtctggtgg tttaacgtcg atcgccatga agatgctatc 120 

agtctggcga atcaaacaat tgcatcccag gctgaaaccg cacacgtcgc ggtcattagc 180 

atggacagcg atccggcgaa aatctttcaa ttagatgatt ctcaagggcc ggaaaaaata 240 
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aaattatttt 
tgttctattg 
aacattcctg 
aaccattgtt 
ttgcttgagg 
catttgctgg 
agcgttcagc 
ccacgcagcg 
ctatcggaac 
gctcaggcgg 
cgcagcattc 
aataccgcta 
gttattccgt 
cagaagttta 
ctcaaactgc 
aataaccctc 
ggtatccgcg 
accattggcg 
accgcgttga 
tggtttgaag 
caatggggca 
gatggtcgcc 
cgctcatcat 



caatgctaaa 
atccccataa 
ccgagcggct 
cgcttttggt 
aataccgttc 
atattgcctt 
aacaaaatgg 
acgaaaaacg 
actggcaact 
cgacggtggt 
ataccctgcg 
gcctgcgcgc 
ggaatgcgcc 
gtcgctatgt 
gtggtttcca 
tattacctgc 
ttgaacaagc 
gtaatcggct 
atcatatttt 
atgatcaaat 
tgccgctgcc 
actggcgacg 
ga 



tcatgaaaaa 
ttaccttttt 
tcgctcatgg 
aattaatccc 
actttttggt 
ctggtgcaac 
tatctggaca 
cattctgagt 
gttcaacaat 
cttttcttta 
tcgccagcgc 
caccgatgaa 
actctcccgt 
gccggaagat 
gaagtgggat 
ccacggtaaa 
cctgacgctg 
ggtgctgttt 
cccattgcct 
cagtgccgag 
tttaacgcaa 
aataccagaa 



ggtctatact 
attcttgttt 
ttggataaaa 
ggaaataata 
cttgccagtt 
gaaaaagggg 
ttagttcaaa 
aatgttgctg 
aacgaagtcc 
cagcaaaatg 
ggtagtgcga 
cgtttgttat 
tgtctgacga 
atcactacct 
gtgttctgta 
ggcgttctgg 
tgtcgcccta 
ctctcattct 
actggcgaca 
ctggtgcaga 
agttctaaac 
cccatgcgac 



atttgacccg 
gcgcaaataa 
tgaataaatg 
acgataaaca 
tgcgttttca 
tcagcgcccg 
gcgaagaggc 
tactggaagg 
tgttcaatga 
cgcaaatcga 
tgaaaatcct 
tggcctgcgg 
tgatcgaaag 
tgctgtcaat 
atgccgtcaa 
ttgccctacg 
accgtaccgg 
gtcggattaa 
ttttctcaaa 
tgcgcttgct 
cggtcatcaa 
tgttagatga 



tgatttgcag 
cgcatggcaa 
gagcaggtta 
attttcattg 
gggtgaccaa 
tcagcagctt 
ggagatccaa 
tgcgccgccg 
agcccgtacc 
gccactggcc 
cgtgcgggaa 
tgcaaatatg 
cgtgcaaggg 
gacccagccg 
caacatgatg 
tccggtaccg 
cgatatcatg 
cgatctggat 
ccgtatggtc 
tgccccagaa 
tgccgagcac 
tgctgtggag 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1572 



<210> 170 
<211> 189 
<212> DNA 
<213> E. Coli 

<400> 170 

atgaccatca gcgatatcat tgaaattatt gtcgtttgcg cactgatatt tttcccgctg 60 

ggctatctgg cgcggcactc tttgcgacgc attcgcgaca ccttacgttt gttctttgct 120 

aaacctcgtt atgttaaacc ggccgggacg ttacgccgca cggaaaaagc cagggcaacc 180 

aaaaaatga ]_39 

<210> 171 
<211> 1680 
<212> DNA 
<213> E. Coli 



<400 
atgactcaat 
ctttccggct 
aacttccatc 
tacagcctgc 
catgacacct 
ttcagtaccg 
gccatttttg 
tttgtggttg 
ttgtggccag 
accgttgcgg 
ccaacaacgg 
aaatcgacct 
aacatctgtt 
tggtcgcatt 
gcggcgatcc 
ccggcaaata 



> 171 
ttacgcaaaa 
ggaacttcta 
cgctcctcaa 
atcgcttgcg 
ggttgcctgg 
attatttaat 
ttttattagt 
ccatactgct 
ccggacaacc 
cgacgggtgg 
cgaaccttaa 
tcccgtcttc 
cgctttcctg 
tcgatattga 
gtttactgcg 
acgactgcta 



taccgccatg 
ttttctggtt 
tttggtgttt 
ccactggatt 
cccggaaagc 
cgaccttgtc 
ggcctggtta 
atggctgaac 
gacgaccact 
cgcaccggta 
cgcctggctg 
gctgcccgct 
gtcggatata 
gttcaagaac 
cgccagctgc 
tctgtttgat 



ccttcttccc 
aagttcggcc 
gccgcgtttc 
gccctgccga 
ataatgagcc 
acacgcttta 
ttcctgtcac 
gtacttaccc 
gtaacaacga 
gtgggtgata 
aataatttct 
gatgctcagc 
gaagccgccg 
tttaactccg 
gggcagactt 
aacctttcga 



tctggcaata 
tgttgtgggc 
tgctgatgcc 
tcggctttgc 
agggttcgca 
ttaactggca 
aatggattcg 
tggcgggacc 
cgggtggtaa 
tgcccgcaca 
ataacgcgga 
catttgaact 
ggttgatgtc 
ccacctccta 
cgcacactaa 
aactgggctt 



ctggcgcggc 
gggatatctt 
ccttccgcgc 
tttgttctgg 
ggtggcgggg 
gatgattggg 
cattaccgtt 
aagtttctcc 
cgcagcggca 
aactgcaccg 
ggcgaaacgt 
actggtgatt 
gcatccactg 
cagtggcccg 
tctgtatcaa 
tacccagcac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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ctgatgatgg gacataacgg ccagttcggc ggttttttga aagaagttcg cgaaaatggc 1020 

ggcatgcaga gcgaattgat ggatcaaaca aatctgccgg ttattttgct gggctttgat 1080 

ggttcgccgg tttatgacga taccgctgtg cttaaccgct ggctggacgt taccgaaaaa 1140 

gataaaaaca gccgtagtgc cacgttctac aacacgcttc cactgcatga cggcaaccat 1200 

tatccggggg tcagcaaaac agcggattac aaagcgcggg cgcagaaatt ctttgatgaa 1260 

ctggacgcct tctttactga acttgagaaa tcgggtcgta aagtgatggt ggtcgtggtg 1320 

ccggaacacg gcggcgcgct gaagggcgac agaatgcagg tatctggcct acgtgatatc 1380 

cctagcccgt ctatcaccga cgtccccgtt ggggtgaaat tcttcggcat gaaggcaccg 14 4 0 

catcaggggg caccgattgt catcgaacaa ccgagcagct tcctggctat ctccgatctg 1500 

gtggttcgcg ttctcgatgg caagattttc accgaagaca atgttgactg gaaaaaactc 1560 

accagtgggt tgccacaaac agcaccggtc tccgagaact caaatgcagt agttattcaa 1620 

taccaggata aaccgtacgt tcgcctgaac ggcggcgact gggtgcctta cccgcagtaa 1680 

<210> 172 

<211> 384 

<212> DNA 

<213> E. Coll 

<400> 172 

atggaaggtt caagaatgaa ataccgcatc gctttagctg tttctctctt tgctcttagt 60 

gccggtagtt atgccactac cctgtgtcag gaaaaggagc aaaatatcct taaggagatc 120 

agctatgccg aaaaacacca aaaccagaat cgtattgacg gtctgaataa agccctgagt 180 

gaagtccggg ccaactgttc agatagccag ctgcgtgccg atcatcagaa gaaaatcgca 240 

aagcagaaag atgaggtggc ggaacgccag caagatttag ccgaggcgaa gcaaaaaggc 300 

gatgccgata agattgccaa acgcgaacgg aaactggcag aagcgcagga agagctgaaa 360 

aagctggaag cgcgcgacta ctaa 384 

<210> 173 
<211> 306 
<212> DNA 
<213> E. Coll 

<400> 173 

atgtcgaaag aacacactac ggaacatctg cgtgctgagt tgaaatccct ttccgatacg 60 

ctggaagagg tgcttagctc atctggcgag aagtcgaaag aagagttgag taagattcgt 120 

agcaaagcgg agcaggcact gaaacagagc cgttatcgcc tgggtgaaac cggtgatgcc 180 

attgccaaac aaacccgtgt cgcggcggcg cgtgccgatg agtatgtgcg cgaaaatccg 240 

tggacgggcg tgggcattgg cgctgcaatc ggtgtagtgc tcggcgttct gctgtcgcgt 300 

cgttaa 3Q5 

<210> 174 
<211> 405 
<212> DNA 
<213> E. Coli 

<400> 174 

atggcggaca ctcatcacgc acaagggccc ggtaaaagcg ttctgggcat cgggcagcga 60 

attgtttcta tcatggttga aatggtagag acacgtctgc ggctggcggt ggtggagctg 120 

gaagaggaaa aagcgaatct ctttcaactt ttactgatgc tgggcctgac gatgcttttc 180 

gctgcatttg gtcttatgag cctgatggtg ctaattattt gggcggttga cccgcaatat 240 

cgcctgaatg cgatgattgc caccaccgtg gtgttgctgc tactggcact gattggcggt 300 

atctggacgc tacgtaaatc gcgtaagtct acgttgctgc gccatacacg ccatgagtta 360 

gcaaacgatc ggcagctgct cgaggaggag tcccgtgagc agtaa 405 

<210> 175 
<211> 300 
<212> DNA 
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<213> E. Coli 



<400> 175 

gtgagcagta aagtcgaacg tgaacgacgt aaggcgcaac tgcttagcca gatccagcaa 60 

caacggctgg atctttccgc cagtcgtcgt gaatggctgg agacaacagg cgcttacgat 120 

cgtcgctgga atatgctgct aagtctgcgc tcctgggcgc tggttggcag tagcgtgatg 180 

gcgatctgga cgattcgcca tcctaatatg ctggtccgct gggccagacg cggttttggc 240 

gtatggagcg cctggcgtct ggttaaaacg accctcaagc agcaacagct tcgcggttaa 300 

<210> 176 
<211> 483 
<212> DNA 
<213> E. Coli 

<400> 176 

atgattctct ccatcgacag caacgacgct aataccgcgc cattgcacaa aaaaacaatc 60 

agcagcctga gtggcgcagt ggagagtatg atgaaaaaat tagaagatgt tggtgtactg 120 

gtagcgcgca ttttaatgcc gattctgttt attaccgctg gctggggaaa aattactggc 180 

tacgcgggta cccaacaata tatggaagca atgggcgtcc cgggttttat gctgccactg 240 

gtgattctgc ttgagtttgg tggtggtctg gcaatcctgt tcggtttcct gactcgcacc 300 

acagccctgt ttactgcggg ctttacgctg ctgacggcat ttttatttca cagcaacttt 360 

gctgaaggcg tcaactcgct gatgttcatg aaaaacctga caatttctgg cggattcctg 420 

ctgctggcaa ttaccggtcc gggcgcgtat agcatcgacc gcctgctgaa taaaaagtgg 480 

taa 483 

<210> 177 
<211> 891 
<212> DNA 
<213> E. Coli 

<400> 177 

atgatcaaga agacaacgga aattgatgcc atcttgttaa atctcaataa ggctatcgat 60 

gcccactacc agtggctggt gagtatgttt cacagcgtgg tcgcgagaga tgccagtaag 120 

ccagaaataa cggataacca ttcttatgga ctgtgccagt ttggtcggtg gattgatcat 180 

ctggggccac tcgataacga tgaattacct tacgttcggc taatggattc tgcccatcaa 240 

catatgcata actgtggtcg ggaattaatg ctggctattg ttgaaaatca ctggcaggac 300 

gcgcatttcg acgcctttca ggaggggttg ctttctttta ctgcggcatt aaccgattac 360 

aaaatttatt tgctgacgat ccgtagcaat atggatgttt tgacgggatt gccgggtcgt 420 

cgggttcttg atgaatcctt tgatcatcag ttacgcaacg ctgagcctct gaatctttat 480 

ttaatgttgt tggatattga ccgatttaaa ttggttaatg atacctacgg gcatttaatc 540 

ggcgatgtag tattacgcac cctggcaact tacttagcca gttggacgcg tgattacgaa 600 

acggtttatc gctacggggg cgaagaattt atcattattg tcaaagcggc taatgatgaa 660 

gaagcatgtc gtgcaggtgt cagaatttgc cagttagtcg ataaccatgc catcacacat 720 

tctgaagggc atatcaacat taccgtgaca gcaggtgtga gtcgcgcatt tcctgaagag 780 

cctctggatg tggtcattgg aagagcggac cgggcaatgt atgagggtaa gcaaaccgga 84 0 

agaaatcgct gcatgtttat tgacgaacaa aatgtgatta accgagttta a 891 

<210> 178 
<211> 612 
<212> DNA 
<213> E. Coli 

<400> 178 

atgcgccttc gtgttgtgcc cggttttatt tcaccacctc cgggcttcgg tggtctcggc 60 

tataccccta cagcgagagc ttgtgttaac atttcaatac ccttacagtt gagagttatt 120 

gatatgttgg atgtatttac tccattgttg aaactttttg ctaacgagcc actcgaaaga 180 

cttatgtata cgattatcat ttttggtctc actctctggc tgataccgaa agagtttact 240 
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gtcgcattca atgcttatac tgaaatacct tggctctttc agattatcgt ttttgccttt 300 

tctttcgtgg tcgccatttc cttctcaaga ttgcgagcac atattcaaaa gcattattca 360 

ttactaccag agcaacgagt attgcttcgt ttatctgaga aagaaatcgc tgtatttaaa 420 

gatttcctta aaacaggaaa tcttattatc acttctcctt gccgtaaccc ggttatgaaa 480 

aaattagaac ggaagggcat cattcaacat cagagtgata gcgcaaactg ttcttattat 540 

ctcgtcaccg aaaaatactc ccattttatg aagttattct ggaacagcag gagtagacgt 600 

tttaatcgtt ag 522 

<210> 179 
<211> 177 
<212> DNA 
<213> E. Coli 

<400> 179 

gtgcttctcc aaccatcggc gcgcaccagt ttcggtttta aatgttttgc ttttggtata 60 

cgtcatggca gtgaacgttc catcctggtt ggggaacacg ccgcacacca gggattcgtt 120 

gttgccgagg tcgatttttt gcattttgcg aatctcacat cttgttgcta cgtatag 177 

<210> 180 
<211> 4281 
<212> DNA 
<213> E. Coli 

<400> 180 

atgagcggaa aaccagcggc gcgtcaggga gatatgactc agtatggcgg tcccattgtc 60 

cagggttcgg caggtgtaag aattggcgcg cccaccggcg tggcgtgctc ggtgtgtccg 120 

ggcgggatga cttcgggcaa cccggtaaat ccgctgctgg gggcgaaggt gctgcccggc 180 

gagacggacc ttgcgctgcc cggcccgctg ccgttcattc tctcccgcac ctacagcagc 240 

taccggacga agacgcctgc accggtgggc gttttcggcc ccggctggaa agcgccttct 300 

gatatccgct tacagctacg tgatgacgga ctgatactca acgacaacgg cgggcggagc 360 

attcactttg agccgctgct gccgggggag gcggtgtaca gccgcagtga gtcaatgtgg 420 

ctggtgcgcg gtggtaaggc agcacagccg gacggccata cgctggcgcg gctgtggggg 4 80 

gcgctgccgc cggatatccg gttaagcccg catctttacc tggcgaccaa cagcgcacag 540 

gggccgtggt ggatactggg gtggtctgag cgggtgccgg gtgctgagga cgtactgcca 600 

gcgccgctgc cgccgtaccg ggtgcttacc gggatggcgg accgcttcgg gcggacgctg 660 

acgtaccggc gtgaggccgc cggtgacctg gccggggaaa tcaccggcgt gacggacggt 720 

gccgggcggg agttccgtct ggtgctgacc acgcaggcgc agcgtgcgga agaggcccgc 780 

acctcttcgc tatcttcttc tgacagttcc cgccctctct cagcctcagc gttccccgac 840 

acactgcccg gtaccgaata cggccccgac aggggtatcc gcctttcggc ggtgtggctg 900 

atgcacgacc cggcataccc ggagagcctg cccgctgcgc cactggtgcg gtacacgtat 960 

acggaagccg gtgaactgct ggcggtatat gaccgcagca atacgcaggt gcgcgctttc 1020 

acgtatgacg cgcagcaccc gggccggatg gtggcgcacc gttacgcggg aaggccggag 1080 

atgcgctacc gctacgacga tacggggcgg gtggtggagc aactgaaccc ggcagggtta 1140 

agctaccgct atctttatga gcaggaccgc atcaccgtca ccgacagcct gaaccggcgt 1200 

gaggtgctgc atacagaagg cggggccggg ctgaaacggg tggtgaaaaa agaactggcg 1260 

gacggcagcg tcacgcgcag cgggtatgac gcggcaggaa ggctcacggc gcagacggac 1320 

gcggcgggac ggaggacaga gtacggtctg aatgtggtgt ccggcgatat cacggacatc 1380 

accacaccgg acgggcggga gacgaaattt tactataacg acgggaacca gctgacggcg 1440 

gtggtgtccc cggacgggct ggagagccgc cgggaatatg atgaaccggg caggctggta 1500 

tcggagacat cgcgcagcgg ggagacagta cgctaccgct acgatgacgc gcacagtgag 1560 

ttaccggcga cgacaacgga tgcgacgggc agcacccggc agatgacctg gagccgctac 1620 

gggcagttgc tggcgttcac cgactgctcg ggctaccaga cccgttatga atacgaccgc 1680 

ttcggccaga tgacggcggt ccaccgcgag gaaggcatca gcctttaccg ccgctatgac 1740 

aaccgtggcc ggttaacctc ggtgaaagac gcacagggcc gtgaaacgcg gtatgaatac 1800 

aacgccgcag gcgacctgac tgccgttatc accccggacg gcaaccggag cgagacacag 1860 

tacgatgcgt ggggaaaggc ggtcagcacc acgcagggcg ggctgacgcg cagtatggag 1920 

tacgatgctg ccggacgtgt catcagcctg accaacgaga acggcagcca cagcgtcttc 1980 
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agttacgatg 
tatcattatg 
tggtactacg 
cagtggcagt 
cgtgttgccg 
gtggagaacc 
gagcaggggc 
tatggcagcg 
cgggacaggc 
gcatacgaac 
agcctggtat 
ggcccgcgac 
accctcgcac 
ctgccggacc 
gcggaggatg 
gaccgcatcc 
gacagccagc 
agccgctacc 
cgtgacctga 
gacggagaca 
ccgggaagct 
cagcggcgca 
gtgttcccgg 
cgcgtgagca 
gccagacagg 
caccggggac 
tatgatgaat 
cgtctgccag 
tacgatccgt 
aatttatatc 
acttgggatg 
ataataggac 
acgcagaata 
aatggaaaat 
ctgaaaagaa 
gatagtcatg 
agtaaagata 
cttaataata 
tgcataccgt 



cgctggaccg 
acctgaccgg 
atgaatcgga 
atgatggcca 
tccactatgg 
cggagacggg 
tggcaaaccg 
gttacctggc 
tgcaccgtga 
tgaccagcac 
atgaccgtga 
agacgcggga 
cagacctgga 
cggagctgca 
cgcactatgt 
cggcgggtgt 
accgcctggt 
tctacgaccc 
cggggtggat 
ggctgacgac 
tcacgccgct 
gcctggcaga 
ctgaactggt 
gtgaaagccg 
tggagccgga 
tgccgctggc 
ggggcaacca 
ggcagcagca 
tgcaggggcg 
agtatccttt 
atgccagatc 
caagtaaatt 
gatctctatg 
attttgcatc 
aatgtcccac 
gcgattatgt 
ataatcttga 
aaggagaata 
atcatgatta 



gctggtacag 
aaaactcaca 
ccgtatcact 
cggctggctg 
ctatgacgat 
ggaactgctg 
cgtcacgccg 
gggaatgaag 
gacggtgcgc 
atacaccccc 
ctacgggtgg 
atacggctac 
catccgcatc 
cccggacagt 
ctaccgccac 
gatacggacg 
gttctacacg 
gctgggacgg 
gtcgctgtcg 
ggtgcagact 
catccgggtc 
gacgctccag 
gcggctgctg 
ggcgtggctt 
atacacaccg 
gcttatcagc 
gcttaatgag 
tgatgaggaa 
gtatattact 
aaatccacta 
tggagcatgt 
tgatagtact 
taatgatatg 
taaggcagaa 
aggtacagat 
tgatgaattt 
agcattttat 
tatttttatc 
a 



cagggcggct 
cagagtgagg 
caccgcacgg 
acagacatca 
aaaggccgcc 
tggcagcatg 
gacagcctgc 
ctgggcggga 
agcttcggca 
gcaggccagt 
agtgacaacg 
agcgccacgg 
ccgtatgcca 
acactcacag 
gatgaatacg 
gacgacgagc 
cggatacagc 
cgaatggcaa 
cgtaaaccgg 
gacaccacac 
gagacagaga 
caggaaggga 
gacaggctgg 
gcgcagtgcg 
gcgcgaaaag 
gaagacggca 
gagaacccgc 
tcagggctgt 
caggacccga 
caacaaattg 
acggggggag 
gcagatgctg 
gaatactctg 
actgataatt 
agagttgctg 
ttttcaagta 
ctcgcaacac 
agaaatagtg 



ttgacgggcg 
atgagggact 
tgaacggcga 
gccacctgag 
tgaccggcga 
agacgaaaca 
cgccggtgga 
cgccgctggt 
gcatggcagg 
tacagagcca 
gcgacctggt 
gcaggctgga 
cggacccggc 
tgtggccgga 
gcaggctgac 
ggacccacca 
atggcgagcc 
aacgggtctg 
aggtgacgtg 
gtatccagac 
acggcgagcg 
gtgagaacgg 
aggaagaaat 
ggctgacggt 
ctcatcttta 
atacggcgtg 
atcatgtgta 
actataaccg 
tggggttgaa 
accctatggg 
tttgtggtgt 
cgttagatgc 
gtattgtctg 
taagaaagga 
cttatcatac 
gcgataaaaa 
ctgatggacg 
tcccgggatt 



gacgcaacgt 
tgtcatcctc 
accggcagag 
cgaaggccac 
atgccagacg 
cgcatacaac 
gtggctgacg 
cgagtatacg 
cagtaatgcc 
gcacctgaac 
gcgcatcagc 
gagtgtgcgc 
gggcaaccgg 
taaccgcatc 
ggagaagacg 
ctaccactac 
actggtcgag 
gcggcgggag 
gtatggctgg 
ggtatacgag 
ggaaaaagcg 
ccacggcgtg 
ccgggcagac 
ggagcaactg 
tcactgcgac 
gagcgcggaa 
tcagccgtac 
tcaccggtac 
agggggatgg 
attattgcag 
tctttcacgt 
tttgaaagaa 
taaagatact 
gtcatatcct 
tcacggtgca 
tcttgtaaga 
atttgaggcg 
gagttcagta 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4281 



<210> 181 
<211> 369 
<212> DNA 
<213> E. Coli 



<400> 181 
atgaaatata gttcaatatt ttcgatgctt 
acagctgttt acggttctga tgaaaacatt 
ttagataaat actctgttaa aaatacggta 
gaaatatatg ttaggtatcg ctatggcgaa 
attacggaac taccagatag ttgggttgtt 
ggtggtgtat ttattataga aattaataag 
agtaaataa 



tcatttttta 


tactatttgc 


ctgtaatgag 


60 


atttttatga 


ggtatgtgga 


aaaattacat 


120 


aaaactgaaa 


caatggcgat 


acaattagct 


180 


cggattgcag 


aagaagaaaa 


accatattta 


240 


gagggagcaa 


agttacctta 


tgaagttgcg 


300 


aaaaatggat 


gtgttttgaa 


tttcctacat 


360 








369 



<210> 182 
<211> 711 
<212> DNA 



"66- 



<213> E. Coli 



<400> 182 

atgctggcgc tgatggatgc ggatggaaac attgcgtgga gcggggagta tgatgagtgg 60 

ggcaaccagc tgaatgaaga gaacccgcat cacctgcacc agccgtaccg gctgccgggg 120 

cagcagtatg ataaggagtc ggggctgtac tacaaccgga accggtacta cgatccgttg 180 

caggggcggt atatcactca ggacccgata gggctggagg ggggatggag tctgtatgcg 24 0 

tatccgctga atccggtgaa tggtattgat ccattagggt taagtcccgc agatgtagcg 300 

ctaataagaa gaaaagatca actaaaccat caaagagcat gggatatatt atctgatact 360 

tatgaagata tgaagagatt aaatttaggt gggactgatc aatttttcca ttgtatggca 420 

ttttgtcgag tgtctaaatt aaatgacgct ggtgttagcc gatcggcgaa agggctgggt 4 80 

tatgaaaaag agattagaga ttacgggtta aatctgttcg gtatgtacgg cagaaaagta 540 

aagctatccc attctgaaat gattgaagat aataaaaaag acttggctgt aaatgaccat 600 

gggttgacat gtccatcaac aacagattgc tcagatagat gtagtgatta tattaatcca 660 

gagcataaaa aaacgataaa ggctttacaa gatgctggct atctcaagta a 711 

<210> 183 

<211> 261 

<212> DNA 

<213> E. Coli 

<400> 183 

atgctggcta tctcaagtaa tctatcaaag atgataatat ttatttttgc tattataatc 60 

attgttgttt tatgcgtaat tacttatctt tatttataca aagatgaatc tcttgtaagt 120 

aaacattaca taaactatat ggcaatacca gaaaatgatg gagtttttac atggctccca 180 

gatttttttc cgcacgtagc ggtggatata tcaatataca caaatgtaga agatgattat 240 

ttttttctta tttttcccta a 261 

<210> 184 
<211> 192 
<212> DNA 
<213> E. Coli 

<400> 184 

gtgagggcca gggaacaagt ggcgaaaatc gtatcaaaga atgatccaga tacaaaaaaa 60 

gtgtggtgta aatatggtaa gataccaggg caaggggatg gtgtaaacct tttttttgtt 120 

ggtgaaatta atgttacgca ttattttata acaaatattg gagctggatt gcctgatgct 180 

tgtgcagagt aa ]_92 

<210> 185 
<211> 504 
<212> DNA 
<213> E. Coli 

<400> 185 

atgccgggca acagcccgca ttatgggcgt tggcctcaac acgattttac gtcacttaaa 60 

aaactcaggc cgcagtcggt aacctcgcgc atacagccgg gcagtgacgt catcgtctgc 120 

gcggaaatgg acgaacagtg gggctatgtc ggggctaaat cgcgccagcg ctggctgttt 180 

tacgcgtatg acagtctccg gaagacggtt gttgcgcacg tattcggtga acgcactatg 240 

gcgacgctgg ggcgtcttat gagcctgctg tcaccctttg acgtggtgat atggatgacg 300 

gatggctggc cgctgtatga atcccgcctg aagggaaagc tgcacgtaat cagcaagcga 360 

tatacgcagc gaattgagcg gcataacctg aatctgaggc agcacctggc acggctggga 420 

cggaagtcgc tgtcgttctc aaaatcggtg gagctgcatg acaaagtcat cgggcattat 480 

ctgaacataa aacactatca ataa 504 

<210> 186 
<211> 276 
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<212> DNA 
<213> E. Coli 



<400> 186 

gtggcttctg tttctatcag ctgtccctcc tgttcagcta ctgacggggt ggtgcgtaac 60 

ggcaaaagca ccgccggaca tcagcgctat ctctgctctc actgccgtaa aacatggcaa 120 

ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc agaaaatcat tgatatggcc 180 

atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct caacacgatt 240 

ttacgtcact taaaaaactc aggccgcagt cggtaa 276 

<210> 187 
<211> 417 
<212> DNA 
<213> E. Coli 



<400 
atgatgacta 
tttgaagcac 
aattcaacta 
cctgattatc 
tttagcctct 
agaaataatg 
agcatttcaa 



> 187 
aaacccaaat 
cgctcaagga 
attgcctgga 
ttactttttt 
cgattccaga 
atgatttcat 
tatttactta 



aaataaatta 
atcatttatt 
gaagttatgt 
aagagcaatg 
accttcagtt 
aaaccctgat 
tgacattaaa 



ataaaaatga 
gaaagtataa 
aatgaagtta 
gatggattcg 
aaaaaccttt 
ctacaagaac 
ggtgatgctg 



tgaatgattt 
tccaaataga 
gtattctttt 
aagttaatgg 
ttgccgtaaa 
ggttagtgat 
ccaacttact 



agactatcca 
atttaattct 
taagaatcaa 
attacgatta 
tgaattttat 
cggggattat 
gatttag 



60 
120 
180 
240 
300 
360 
417 



<210> 188 
<211> 1179 
<212> DNA 
<213> E. Coli 



<400 
atgagtaata 
tgtgggactt 
acattctcac 
acattttgta 
gaacaattat 
tattattata 
aaccaactgg 
atcgcaaata 
ccaaattcaa 
aggctacttg 
gaaaaactgg 
ggcatagcga 
acagaaataa 
gacataatag 
gagtttaata 
tctggaagat 
tataatgttc 
gtattaatcg 
aagggaatat 
tatcctccca 



> 188 
ttgtttacct 
ctgagtctac 
tcttaaataa 
aattaattga 
ttatgggatt 
tacaactaag 
atacagaaac 
ccgagttcag 
aaaaccaaac 
ctgctggtgg 
gtggagatgc 
tagcaacagc 
atagttactt 
aaataatata 
aaactgtaag 
tcaacgccga 
accataaact 
aaaacgaacc 
tagtaggtga 
tgaaaaatat 



gacagtaacg 
aggtaatcgt 
tattaataat 
taaaagcact 
tgacttctat 
aggcgctttt 
aataactatt 
ctatttggca 
aaataatcgt 
cgtatacaat 
aataaaaggc 
atctattctt 
aggcaaactt 
cattaagaga 
aaaaaatttt 
agacctttta 
atctctagat 
atatcataaa 
aagcaaaatc 
tatggaccac 



ggagaacaac 
tggcagagcg 
acggggcttg 
ccattattta 
cgaataaata 
ttatcggcta 
agttatgaat 
ctccctgaaa 
ttcaaaacgt 
gggaacattg 
tatgatcaaa 
ttaacaaagc 
agaggtcaac 
ccttcaaaag 
cttatcaaac 
agaatgagaa 
gatggtggta 
gtttttacta 
actccctggg 
acaaaatga 



aaggaagcat 
ggcatgagga 
gttcacagtt 
ttaattccat 
gatttggtag 
ttcatcacca 
ttatcctctg 
attataaccg 
taaacagcaa 
aaggattcag 
tactaaatga 
gttctaatgt 
aaaaacttct 
acttagctaa 
ttgcaaaaac 
agggcaatgt 
ctaatgattt 
acatgcaatc 
ccattccatc 



ctccgcaggt 
tgaaatattt 
ccatggtata 
taacaataat 
attggaaaag 
gatcattgaa 
tcaacatctt 
tttgttttta 
agctattggc 
agatactgcg 
aaaaacagcg 
tgatacatat 
tgatggtata 
cttacgaaag 
ctccgaagca 
tcctctaaat 
cgaaaattta 
acgaatagct 
tggctcaatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1179 



<210> 189 
<211> 666 
<212> DNA 
<213> E. Coli 



<400> 189 
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atggtacttg ctttgaacta 
aaacctcata cgatgccctc 
ggccacgccc ttgattttgg 
tttgatgaag ttacttttct 
ggaattaaaa ctaaaattat 
gctttcgagg atgtcgacaa 
ctctctgccg ttccttgtcg 
ctaaaatcag gaggtgagac 
tacgaaacag gaagaaaaca 
tactatggat tactcgatga 
atattaaagt cgtggagtaa 
atataa 



taatatgcac 
tagatatctt 
ctgcggaaaa 
agactcgaaa 
tgactatgtc 
aataattggg 
ggatacaatc 
tcttattgta 
tctttacgga 
actcgcagtg 
agcaggaagt 



ggagttaata 
tgcgagtata 
cttagatatt 
aggcaacttg 
ccacgatatt 
ggttacgatt 
gacaaaatag 
aatcaatata 
tacatttaca 
caagaaatat 
tcatatgtca 



ttcgctcaga gaatgcagca 
ttagaagcat tgagaaaaat 
ctgatgaatt aatcagtaaa 
aaagagagca aattattaga 
ataaaaatgc aaatacagtt 
tcatcctttg ctctaatgtt 
ttcttagcat caagagatta 
aaagctcata cttcaaaaaa 
aaaattcaaa aagtgtttct 
gttcttcaca tggccttgaa 
ctgttgggag ttgtaatgca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
666 



<210> 190 
<211> 705 
<212> DNA 
<213> E. Coli 

<400> 190 

gtgaataata tgttcgaacc ccccaaaaat tataatgaaa tgttgcctaa acttcataaa 60 

gcaactttct taaatacgct aatatattgc atacttctag ttatttacga atacatccct 120 

ttaataacat taccaaccaa gtatgtccca cctattaaag atcatgagag ctttattaat 180 

tgggcactat cttttggtat attaccttgt gcttttgcga tttttgcata tttaattagc 240 

ggtgcgttag acctacataa caatgcagcc aaactacttc gggtgcgata tctttgggat 300 

aagcatctaa ttataaaacc gttatcacgg agagctggag tcaacagaaa attaaataaa 360 

gatgaagctc acaatgtaat gagcaatcta tattaccctg aagtaagaaa aattgaagac 420 

aaacattata ttgaactctt ctggaataaa gtatactatt tttggatatt ttttgaattt 480 

tcgataattg cattaatttc cttcctaata atcttttttt gcaaacaaat ggatattttt 540 

catgttgaag gttctttgct gtctttattc ttttttgtaa ttttatcatt ctcagtgagt 600 

ggtattatct ttgctttgac agttaagccc agaactgaaa gtcaagtcgg aaaaatcccg 660 

gacgataaaa taaaagaatt tttcactaaa aataacatta attga 705 

<210> 191 
<211> 285 
<212> DNA 
<213> E. Coli 

<400> 191 

atgtttacta tcaacgcaga agtacgtaaa gagcagggta agggtgcgag ccgccgcctg 60 

cgtgccgcta acaagttccc ggcaatcatc tacggtggca aagaagcgcc gctggctatc 120 

gagctggatc acgacaaagt catgaacatg caagctaaag ctgaattcta cagcgaagtt 180 

Gtgaccatcg ttgttgacgg taaagaaatc aaagttaaag ctcaggacgt acagcgtcac 240 

ccgtacaaac cgaagctgca gcacatcgac ttcgttcgcg cttaa 285 

<210> 192 
<211> 1977 
<212> DNA 
<213> E. Coli 



<400> 192 



atggtattgt tttatcgggc 
atgaatctga cgactctgac 
cgtgaagagg ccatccacgc 
actgagcaat ttctggaaga 
gaagggttgg ctgtgccgca 
gccacactca gcgagccgct 
gtggtgctgc tggcgattcc 



acactggcgc 
ccaccgcgat 
gttgactcaa 
agtgtatcgc 
tggcaaaact 
tcagtgggaa 
ccccaatgaa 



gactataaaa 
gcgttgtgtc 
cgtcttgctg 
cgtgaaagcc 
gctgcggtaa 
ggcgttgatg 
gcgggtacta 



acgatcaagt gaggatcatg 
tgaatgcgcg ctttaccagc 
ctctggggaa aatttccagt 
ttggcccgac cgccttaggt 
aagaagcggc gtttgcggtc 
gcccggaagc agttgattta 
cgcatatgca actgctgaca 



60 
120 
180 
240 
300 
360 
420 
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gcgctgacca cgcgccttgc ggatgatgag attcgggcgc gtatacagtc ggcgacgacg 480 

cctgatgagt tgctctcggc gctggatgac aagggaggca cgcaaccttc tgcctctttt 54 0 

tccaacgcgc caactatcgt ctgcgtaacg gcctgtccgg cgggtattgc tcacacctat 600 

atggctgcgg aatatctgga aaaagccgga cgcaaactcg gcgtaaatgt ttacgttgaa 660 

aaacaaggcg ctaacggcat tgaagggcgt ttaacggcgg atcaactcaa tagtgcaacc 720 

gcctgtattt ttgcggctga agtcgccatc aaggagagtg agcgttttaa tggcattccc 780 

gcgctttcag tgcctgttgc cgagccgatt cgccatgcag aagcgttgat ccaacaagcg 840 

cttaccctca agcgtagcga tgagacgcgt accgtacagc aagatacgca accggtgaaa 900 

agtgtcaaaa cggagctgaa acaggcactg ttgagcggaa tctcttttgc cgtaccgttg 960 

attgtcgcgg ggggcacggt gctggcggtc gcggtattac tgtcgcaaat cttcgggcta 1020 

caagatctgt ttaatgaaga aaactcctgg ctgtggatgt accgcaagct gggcggcggg 1080 

ctgctcggaa ttttgatggt accggtgctc gcggcctata ccgcctattc tctggcagat 1140 

aaaccggcgt tagcgccagg ctttgcggct ggacttgccg ccaacatgat cggctccggg 1200 

tttctcggcg cggtcgttgg cggattgata gccggttact tgatgcgctg ggtgaaaaat 1260 

cacttgcgtc ttagcagtaa attcaatgga ttcctgactt tttatctcta cccggtgctc 1320 

ggtacgttgg gagcgggcag tctgatgctg tttgtggtgg gggaacctgt cgcctggatc 1380 

aataactcgc ttaccgcctg gctgaacggt ctgtcaggaa gtaacgcgct gttgctgggt 1440 

gccattctcg gttttatgtg ttcctttgac cttggagggc cagtgaataa agccgcttat 1500 

gcattctgcc tgggcgcaat ggcgaacggc gtttacggcc cgtatgccat tttcgcctcc 1560 

gtcaaaatgg tttcggcatt taccgtaacc gcttccacga tgctcgcacc gcgcctgttt 1620 

aaagagtttg aaattgagac cgggaaatcc acctggctgt tagggctggc aggtattacc 1680 

gaaggggcga tcccgatggc gattgaagat ccgctgcggg ttattggttc gtttgtgctg 1740 

ggctctatgg taacgggcgc tattgtcggt gcgatgaata tcggcctttc gacacccggt 1800 

gccggcattt tctcgctctt tttacttcat gataatggcg cgggcggtgt tatggcggca 1860 

attggctggt ttggcgcggc attggtgggg gctgcaatct cgactgcaat tctcctgatg 1920 

tggcggcgtc acgcggttaa gcatggcaac tatctgactg atggcgtaat gccataa 1977 

<210> 193 
<211> 2634 
<212> DNA 
<213> E. Coli 

<400> 193 

atgaaagcag tatctcgcgt tcacatcacc ccgcatatgc actgggatcg agagtggtat 60 

ttcaccaccg aagagtcacg tattctgctg gtcaataata tggaagagat cctgtgccga 120 

ctggaacagg acaacgaata caaatattac gtactcgacg ggcaaacggc gatcctcgaa 180 

gattatttcg cggtgaaacc ggaaaacaaa gaccgtgtga agaaacaggt agaagccggc 240 

aagttgatta tcggcccctg gtatacccag accgatacca cgattgtttc tgcggaatcc 300 

atcgtccgta atctgatgta cggaatgcgt gactgcctcg cgtttggcga gccgatgaaa 360 

ataggttatt taccagattc ctttggcatg tccgggcaac tgccgcatat ctacaatgga 420 

tttggcatta cccgcaccat gttctggcgc ggatgttcgg agcgccacgg tactgataaa 4 80 

accgagtttt tgtggcaaag cagtgacggt agcgaagtga cggcgcaggt gctgccgctg 540 

ggctacgcca tcggtaagta cttacctgcc gacgaaaacg gattacgtaa acgcctcgac 600 

agttattttg acgtgctgga aaaagcgtct gtaaccaaag agattttgct gccgaatggg 660 

catgaccaga tgccattgca gcaaaatatc ttcgaagtga tggataagct acgtgagatc 720 

taccctcaac gtaagtttgt gatgagccgc tttgaagagg tatttgagaa gatcgaagcg 780 

cagcgagata atctggcaac cctgaaaggg gaatttattg atggcaaata tatgcgcgtg 840 

catcgcacca tcggttctac gcgtatggat atcaaaattg cccacgcgcg tattgaaaat 900 

aagattgtta atctgctgga accgctggca acactggcct ggacgttggg ttttgaatac 960 

caccacggct tgctggagaa aatgtggaaa gagatcttaa aaaatcatgc ccacgacagt 1020 

atcggctgct gctgtagtga caaagttcat cgcgaaatcg tcgcccgctt cgaactggct 1080 

gaagacatgg cggataatct gattcgtttc tacatgcgca aaattgccga caacatgccg 1140 

cagagcgacg ccgacaaact cgtcctgttt aacctgatgc cctggccgcg tgaagaagtt 1200 

atcaacacca ctgtgcggct gcgcgccagc cagtttaatt tgcgggacga tcgcggtcag 1260 

cctgtaccgt attttattcg ccatgcccgt gagatcgatc caggcctaat cgatcggcaa 1320 

atagttcatt acggtaatta cgatcccttt atggagtttg atatacagat caaccagatt 1380 

gtcccttcta tgggctatcg cacgctttat atcgaagcga atcagcctgg caacgtaatt 1440 
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gcggcaaaaa gtgacgctga agggatactg gaaaatgctt tctggcaaat tgcgctcaat 1500 

gaggatggtt ctctgcaact ggtagataaa gacagcggtg tgcgctatga ccgggtattg 1560 

caaattgaag aaagctctga tgatggtgat gaatatgact attcacccgc aaaagaagag 1620 

tgggtaatta ccgcagcgaa cgcgaaaccg caatgcgata ttattcatga agcctggcag 1680 

agcagggctg ttatccgcta tgacatggca gtgccgctca atttgtcaga acgcagcgcc 1740 

cggcaatcca ctggcagagt aggggtggtg ttggttgtca ctcttagtca taacagcagg 1800 

cgtattgatg tggatatcaa tcttgataac caggctgacg atcatcgcct tcgtgtcctg 18 60 

gtccctacac cttttaacac cgacagtgtt ctggcagata cgcagtttgg ttcgctaacg 1920 

cgccccgtga acgacagtgc aatgaacaac tggcagcaag aaggctggaa agaagcgccg 1980 

gttccggtat ggaatatgct caactatgtt gccttacagg aagggcgtaa cggcatggct 2040 

gtctttagcg aagggttacg tgaatttgaa gtcatcggtg aagagaagaa aacctttgcc 2100 

attacgttgc tgcgtggcgt gggcttactg ggcaaagaag atctgctttt aaggcctggg 2160 

cggccttcgg gaattaaaat gccagtcccg gactcacaac tacgtggtct gctttcttgt 2220 

cgcctaagtt tattgagtta taccggtacg ccaaccgccg ctggtgtagc tcagcaggcg 2280 

cgagcatggc tgactccagt acagtgttac aacaaaatcc catgggatgt gatgaagctc 2340 

aacaaagccg gattcaacgt gccggaaagt tatagtttgt tgaaaatgcc cccagtggga 2400 

tgcctgataa gcgcacttaa gaaagctgaa gaccgacaag aagtgatttt acggctgttt 24 60 

aatccggctg aatcagcaac ctgtgatgcg actgttgctt tcagtcgcga ggtgatttct 2520 

tgctcagaaa cgatgatgga tgaacacatt accaccgagg aaaatcaagg ttcaaatcta 2580 

tcggggcctt ttttacccgg ccagtcacgg acgttcagtt accggcttgc ctga 2634 

<210> 194 
<211> 1572 
<212> DNA 
<213> E. Coli 

<400> 194 

atgatgttag atatagtcga actgtcgcgc ttacagtttg ccttgaccgc gatgtaccac 60 

ttcctttttg tgccactgac gctcggtatg gcgttcctgc tggccattat ggaaacggtc 120 

tacgtcctct ccggcaaaca gatttataaa gatatgacca agttctgggg caagttgttt 180 

ggtatcaact tcgctctggg tgtggctacc ggtctgacca tggagttcca gttcgggact 240 

aactggtctt actattccca ctatgtaggg gatatcttcg gtgcgccgct ggcaatcgaa 300 

ggtctgatgg ccttcttcct cgaatccacc tttgtaggtc tgttcttctt cggttgggat 360 

cgtctgggta aagttcagca tatgtgtgtc acctggctgg tggcgctcgg ttcaaacctg 420 

tccgcactgt ggattctggt tgcgaacggc tggatgcaaa acccaatcgc gtccgatttc 480 

aactttgaaa ctatgcgtat ggagatggtg agcttctccg agctggtgct taacccggtt 540 

gctcaggtga aattcgttca cactgtagcg tctggttatg tgactggcgc gatgttcatc 600 

ctcggtatca gcgcatggta tatgctgaaa ggtcgtgact tcgccttcgc taaacgctcc 660 

tttgctatcg ctgccagctt cggtatggct gctgttctgt ctgttattgt tctgggtgat 720 

gaatccggct acgaaatggg cgacgtgcag aaaaccaaac tggctgctat tgaagccgag 780 

tgggaaacgc aacctgcgcc tgctgccttt actctgttcg gcattcctga tcaggaagag 840 

gagacgaaca aatttgcgat tcagatccct tacgcactgg gcatcattgc aacgcgttcc 900 

gtggataccc cggttatcgg cctgaaagag ctgatggtgc agcatgaaga acgcattcgt 960 

aacgggatga aggcgtactc tctgctcgaa caactgcgtt ctggttctac cgaccaggcg 1020 

gttcgtgacc agttcaatag catgaagaaa gacctcggtt acggtctgct gctgaaacgc 1080 

tatacgccaa acgtggctga tgcgactgaa gcgcagattc aacaggcaac caaagactcc 1140 

atcccgcgtg tagcgccgct gtactttgcg ttccgtatca tggtggcgtg tggcttcctg 1200 

cttctggcaa tcatcgcgct ctctttctgg agtgtcatcc gcaaccgcat tggcgagaaa 1260 

aaatggcttc tgcgcgccgc gctgtacggt attccgctgc cgtggattgc tgtagaagcg 1320 

ggctggttcg tggctgaata tggccgccaa ccgtgggcta tcggtgaagt gctgccgaca 1380 

gctgtggcga actcgtcact gaccgcaggc gatctcatct tctcaatggt gctgatttgc 1440 

ggcctgtata ccctgttcct ggtggcagaa ttgttcttaa tgttcaagtt tgcacgcctc 1500 

ggcccaagca gcctgaaaac cggtcgctat cactttgagc agtcttccac gactactcag 1560 

ccggcacgct aa 1572 

<210> 195 
<211> 1140 
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<212> DNA 
<213> E. Coli 

<400> 195 

atgatcgatt atgaagtatt gcgttttatc tggtggctgc tggttggcgt tctgctgatt 60 

ggttttgcag tcactgacgg tttcgacatg ggggtgggca tgctcacccg tttcctcggt 120 

cgtaacgaca ccgagcgtcg aattatgatt aactccattg caccacactg ggacggtaac 180 

caggtttggc tgatcaccgc gggcggcgca ctctttgctg cctggccgat ggtctatgcc 240 

gctgcgttct ccggcttcta tgtggcgatg atcctcgtgc tggcgtcttt gttcttccgt 300 

ccggtcggtt ttgactaccg ctccaagatt gaagaaaccc gctggcgtaa catgtgggac 360 

tggggcatct tcattggtag cttcgttccg ccgctggtaa ttggtgtagc gttcggtaac 420 

ctgttgcagg gcgtaccgtt caacgttgat gaatatctgc gtctgtacta caccggtaac 480 

ttcttccagt tgcttaaccc gttcggcctg ctggcaggcg tggtgagcgt agggatgatc 540 

attactcagg gcgcaaccta tctgcaaatg cgtaccgtgg gcgaactgca cctgcgtacc 600 

cgtgcaacgg ctcaggtggc tgcgctggtg acactggtct gtttcgcact ggctggcgta 660 

tgggtgatgt acggtatcga tggttatgtc gtgaaatcga caatggacca ttacgcagcc 720 

tctaacccac tgaataaaga agtggttcgt gaagctggcg catggctggt taacttcaac 780 

aacacgccaa ttctgtgggc tattccggca ctgggtgtgg ttctgccgct gctgaccatc 840 

ctgactgcac gtatggataa agccgcgtgg gcgtttgtgt tctcctccct gacgctggcc 900 

tgcatcatcc tgacagccgg tatcgcaatg ttcccgtttg tgatgccgtc cagcaccatg 960 

atgaacgcaa gtctgacaat gtgggatgca acttccagcc agctgacgct taacgtcatg 1020 

acctgggttg cggtggttct ggtaccgatc attctgctct acaccgcctg gtgttactgg 1080 

aaaatgttcg gtcgtatcac caaagaagat attgaacgta acacccactc tctgtactaa 1140 

<210> 196 
<211> 1371 
<212> DNA 
<213> E. Coli 

<400> 196 

atggaattat cctcactgac cgccgtttcc cctgtcgatg gacgctacgg cgataaagtc 60 

agcgcgctgc gcgggatttt cagcgaatat ggtttgctga aattccgtgt acaagttgaa 120 

gtacgttggc tgcaaaaact ggccgcgcac gcagcgatca aggaagttcc tgcttttgct 180 

gccgacgcaa tcggttacct tgatgcaatc gtcgccagtt tcagcgaaga agatgcggcg 240 

cgcatcaaaa ctatcgagcg taccactaac cacgacgtta aagcggttga gtatttcctg 300 

aaagaaaaag tggcggagat cccggaactg cacgcggttt ctgaattcat ccactttgcc 360 

tgtacttcgg aagatatcaa taacctctcc cacgcattaa tgctgaaaac cgcgcgtgat 420 

gaagtgatcc tgccatactg gcgtcaactg attgatggca ttaaagatct cgccgttcag 480 

tatcgcgata tcccgctgct gtctcgtacc cacggtcagc cagccacgcc gtcaaccatc 540 

ggtaaagaga tggcaaacgt cgcctaccgt atggagcgcc agtaccgcca gcttaaccag 600 

gtggagatcc tcggcaaaat caacggcgcg gtcggtaact ataacgccca catcgccgct 660 

tacccggaag ttgactggca tcagttcagc gaagagttcg tcacctcgct gggtattcag 720 

tggaacccgt acaccaccca gatcgaaccg cacgactaca ttgccgaact gtttgattgc 780 

gttgcgcgct tcaacactat tctgatcgac tttgaccgtg acgtctgggg ttatatcgcc 840 

cttaaccact tcaaacagaa aaccattgct ggtgagattg gttcttccac catgccgcat 900 

aaagttaacc cgatcgactt cgaaaactcc gaagggaatc tgggcctttc caacgcggta 960 

ttgcagcatc tggcaagcaa actgccggtt tcccgctggc agcgtgacct gaccgactct 1020 

accgtgctgc gtaacctcgg cgtgggtatc ggttatgcct tgattgcata tcaatccacc 1080 

ctgaaaggcg tgagcaaact ggaagtgaac cgtgaccatc tgctggatga actggatcac 1140 

aactgggaag tgctggctga accaatccag acagttatgc gtcgctatgg catcgaaaaa 1200 

ccgtacgaga agctgaaaga gctgactcgc ggtaagcgcg ttgacgccga aggcatgaag 1260 

cagtttatcg atggtctggc gttgccagaa gaagagaaag cccgcctgaa agcgatgacg 1320 

ccggctaact atattggtcg agctatcacg atggttgatg agctgaaata a 1371 

<210> 197 
<211> 186 
<212> DNA 
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<213> E. Coli 



<400> 197 

atgctgattc tgactcgtcg agttggtgag accctcatga ttggggatga ggtcaccgtg 60 

acagttttag gggtaaaggg caaccaggta cgtattggcg taaatgcccc gaaggaagtt 120 

tctgttcacc gtgaagagat ctaccagcgt atccaggctg aaaaatccca gcagtccagt 180 

tactaa 235 

<210> 198 
<211> 93 
<212> DNA 
<213> E. Coli 



<400> 198 

ggtgaggtgg ccgagaggct gaaggcgctc ccctgctaag ggagtatgcg gtcaaaagct 60 
gcatccgggg ttcgaatccc cgcctcaccg cca 93 

<210> 199 

<211> 603 

<212> DNA 

<213> E. Coli 



<400> 199 

atgaagaata aggctgataa caaaaaaagg aacttcctga cccatagtga aatcgaatca 60 

ctccttaaag cagcaaatac cgggcctcat gcagcacgta attattgtct gactttgctt 120 

tgttttattc atggtttccg ggcgagtgaa atttgtcgat tgaggatttc ggatattgat 180 

cttaaggcaa agtgtatata tatccatcga ttaaaaaaag gcttttcaac aacgcacccg 240 

ctattgaata aagaagttca ggctttaaaa aactggttga gtatccgtac ttcgtacccg 300 

catgctgaga gcgagtgggt atttttatca cgtaagggga atccgctttc tcggcaacag 360 

ttttaccata ttatctcgac ttccggtggt aatgccgggt tgtcactgga gattcatccg 420 

cacatgttac gccattcgtg tggttttgct ttggcgaata tgggaataga tacgcgactt 480 

atccaggatt atcttgggca tcgcaatatt cgtcatactg tctggtatac cgccagcaat 540 

gcagggcgtt tttacggcat ctgggataga gccagaggac gacagcgtca cgctgtttta 600 

tag 603 

<210> 200 
<211> 597 
<212> DNA 
<213> E. Coli 



<400> 200 

gtgagtaaac gtcgttatct taccggtaaa gaagttcagg ccatgatgca ggcggtttgt 60 

tacggggcaa cgggagccag agattattgt cttattctgt tggcatatcg gcatgggatg 120 

cgtattagtg aactgcttga tctgcattat caggaccttg accttaatga aggtagaata 180 

aatattcgcc gactgaagaa cggattttct accgttcacc cgttacgttt tgatgagcgt 240 

gaagccgtgg aacgctggac ccaggaacgt gctaactgga aaggcgctga ccggactgac 300 

gctatattta tttctcgccg cgggagtcgg ctttctcgcc agcaggccta tcgcattatt 360 

cgcgatgccg gtattgaagc tggaaccgta acgcagactc atcctcatat gttaaggcat 420 

gcttgcggtt atgaattggc ggagcgtggt gcagatactc gtttaattca ggattatctc 480 

gggcatcgaa atattcgcca tactgtgcgt tataccgcca gtaatgctgc tcgttttgcc 540 

ggattatggg aaagaaataa tctcataaac gaaaaattaa aaagagaaga ggtttga 597 

<210> 201 
<211> 549 
<212> DNA 
<213> E. Coli 
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<400> 201 

atgaaaatta aaactctggc aatcgttgtt ctgtcggctc tgtccctcag ttctacagcg 60 

gctctggccg ctgccacgac ggttaatggt gggaccgttc actttaaagg ggaagttgtt 120 

aacgccgctt gcgcagttga tgcaggctct gttgatcaaa ccgttcagtt aggacaggtt 180 

cgtaccgcat cgctggcaca ggaaggagca accagttctg ctgtcggttt taacattcag 240 

ctgaatgatt gcgataccaa tgttgcatct aaagccgctg ttgccttttt aggtacggcg 300 

attgatgcgg gtcataccaa cgttctggct ctgcagagtt cagctgcggg tagcgcaaca 360 

aacgttggtg tgcagatcct ggacagaacg ggtgctgcgc tgacgctgga tggtgcgaca 420 

tttagttcag aaacaaccct gaataacgga accaatacca ttccgttcca ggcgcgttat 480 

tttgcaaccg gggccgcaac cccgggtgct gctaatgcgg atgcgacctt caaggttcag 540 

tatcaataa 549 



<210> 202 
<211> 648 
<212> DNA 
<213> E. Coli 

<400> 202 

gtgctgctaa tgcggatgcg accttcaagg ttcagtatca ataacctacc caggttcagg 60 

gacgtcatta cgggcaggga tgcccaccct tgtgcgataa aaataacgat gaaaaggaag 120 

agattatttc tattagcgtc gttgctgcca atgtttgctc tggccggaaa taaatggaat 180 

accacgttgc ccggcggaaa tatgcaattt cagggcgtca ttattgcgga aacttgccgg 240 

attgaagccg gtgataaaca aatgacggtc aatatggggc aaatcagcag taaccggttt 300 

catgcggttg gggaagatag cgcaccggtg ccttttgtta ttcatttacg ggaatgtagc 360 

acggtggtga gtgaacgtgt aggtgtggcg tttcacggtg tcgcggatgg taaaaatccg 420 

gatgtgcttt ccgtgggaga ggggccaggg atagccacca atattggcgt agcgttgttt 4 80 

gatgatgaag gaaacctcgt accgattaat cgtcctccag caaactggaa acggctttat 540 

tcaggctcta cttcgctaca tttcatcgcc aaatatcgtg ctaccgggcg tcgggttact 600 

ggcggcatcg ccaatgccca ggcctggttc tctttaacct atcagtaa 648 

<210> 203 
<211> 726 
<212> DNA 
<213> E. Coli 

<400> 203 

gtgagtaata aaaacgtcaa tgtaaggaaa tcgcaggaaa taacattctg cttgctggca 60 

ggtatcctga tgttcatggc aatgatggtt gccggacgcg ctgaagcggg agtggcctta 120 

ggtgcgactc gcgtaattta tccggcaggg caaaaacaag agcaacttgc cgtgacaaat 180 

aatgatgaaa atagtaccta tttaattcaa tcatgggtgg aaaatgccga tggtgtaaag 240 

gatggtcgtt ttatcgtgac gcctcctctg tttgcgatga agggaaaaaa agagaatacc 300 

ttacgtattc ttgatgcaac aaataaccaa ttgccacagg accgggaaag tttattctgg 360 

atgaacgtta aagcgattcc gtcaatggat aaatcaaaat tgactgagaa tacgctacag 420 

ctcgcaatta tcagccgcat taaactgtac tatcgcccgg ctaaattagc gttgccaccc 480 

gatcaggccg cagaaaaatt aagatttcgt cgtagcgcga attctctgac gctgattaac 540 

ccgacaccct attacctgac ggtaacagag ttgaatgccg gaacccgggt tcttgaaaat 600 

gcattggtgc ctccaatggg cgaaagcacg gttaaattgc cttctgatgc aggaagcaat 660 

attacttacc gaacaataaa tgattatggc gcacttaccc ccaaaatgac gggcgtaatg 720 
gaataa 

<210> 204 
<211> 2637 
<212> DNA 
<213> E. Coli 

<400> 204 

atgtcatatc tgaatttaag actttaccag cgaaacacac aatgcttgca tattcgtaag 60 
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catcgtttgg ctggtttttt tgtccgactc gttgtcgcct gtgcttttgc cgcacaggca 120 

cctttgtcat ctgccgacct ctattttaat ccgcgctttt tagcggatga tccccaggct 180 

gtggccgatt tatcgcgttt tgaaaatggg caagaattac cgccagggac gtatcgcgtc 240 

gatatctatt tgaataatgg ttatatggca acgcgtgatg tcacatttaa tacgggcgac 300 

agtgaacaag ggattgttcc ctgcctgaca cgcgcgcaac tcgccagtat ggggctgaat 360 

acggcttctg tcgccggtat gaatctgctg gcggatgatg cctgtgtgcc attaaccaca 420 

atggtccagg acgctactgc gcatctggat gttggtcagc agcgactgaa cctgacgatc 480 

cctcaggcat ttatgagtaa tcgcgcgcgt ggttatattc ctcctgagtt atgggatccc 540 

ggtattaatg ccggattgct caattataat ttcagcggaa atagtgtaca gaatcggatt 600 

gggggtaaca gccattatgc atatttaaac ctacagagtg ggttaaatat tggtgcgtgg 660 

cgtttacgcg acaataccac ctggagttat aacagtagcg acagatcatc aggtagcaaa 720 

aataaatggc agcatatcaa tacctggctt gagcgagaca taataccgtt acgttcccgg 780 

ctgacgctgg gtgatggtta tactcagggc gatattttcg atggtattaa ctttcgcggc 840 

gcacaattgg cctcagatga caatatgtta cccgatagtc aaagaggatt tgccccggtg 900 

atccacggta ttgctcgtgg tactgcacag gtcactatta aacaaaatgg gtatgacatt 960 

tataatagta cggtgccacc ggggcctttt accatcaacg atatctatgc cgcaggtaat 1020 

agtggtgact tgcaggtaac gatcaaagag gctgacggca gcacgcagat ttttaccgta 1080 

ccctattcgt cagtcccgct tttgcaacgt gaagggcata ctcgttattc cattacggca 114 0 

ggagaatacc gtagtggaaa tgcgcagcag gaaaaaaccc gctttttcca gagtacatta 1200 

ctccacggcc ttccggctgg ctggacaata tatggtggaa cgcaactggc ggatcgttat 12 60 

cgtgctttta atttcggtat cgggaaaaac atgggggcac tgggcgctct gtctgtggat 1320 

atgacgcagg ctaattccac acttcccgat gacagtcagc atgacggaca atcggtgcgt 1380 

tttctctata acaaatcgct caatgaatca ggcacgaata ttcagttagt gggttaccgt 1440 

tattcgacca gcggatattt taatttcgct gatacaacat acagtcgaat gaatggctac 1500 

aacattgaaa cacaggacgg agttattcag gttaagccga aattcaccga ctattacaac 1560 

ctcgcttata acaaacgcgg gaaattacaa ctcaccgtta ctcagcaact cgggcgcaca 1620 

tcaacactgt atttgagtgg tagccatcaa acttattggg gaacgagtaa tgtcgatgag 1680 

caattccagg ctggattaaa tactgcgttc gaagatatca actggacgct cagctatagc 1740 

ctgacgaaaa acgcctggca aaaaggacgg gatcagatgt tagcgcttaa cgtcaatatt 1800 

cctttcagcc actggctgcg ttctgacagt aaatctcagt ggcgacatgc cagtgccagc 18 60 

tacagcatgt cacacgatct caacggtcgg atgaccaatc tggctggtgt atacggtacg 1920 

ttgctggaag acaacaacct cagctatagc gtgcaaaccg gctatgccgg gggaggcgat 1980 

ggaaatagcg gaagtacagg ctacgccacg ctgaattatc gcggtggtta cggcaatgcc 2040 

aatatcggtt acagccatag cgatgatatt aagcagctct attacggagt cagcggtggg 2100 

gtactggctc atgccaatgg cgtaacgctg gggcagccgt taaacgatac ggtggtgctt 2160 

gttaaagcgc ctggcgcaaa agatgcaaaa gtcgaaaacc agacgggggt gcgtaccgac 2220 

tggcgtggtt atgccgtgct gccttatgcc actgaatatc gggaaaatag agtggcgctg 2280 

gataccaata ccctggctga taacgtcgat ttagataacg cggttgctaa cgttgttccc 2340 

actcgtgggg cgatcgtgcg agcagagttt aaagcgcgcg ttgggataaa actgctcatg 24 00 

acgctgaccc acaataataa gccgctgccg tttggggcga tggtgacatc agagagtagc 24 60 

cagagtagcg gcattgttgc ggataatggt caggtttacc tcagcggaat gcctttagcg 2520 

ggaaaagttc aggtgaaatg gggagaagag gaaaatgctc actgtgtcgc caattatcaa 2580 

ctgccaccag agagtcagca gcagttatta acccagctat cagctgaatg tcgttaa 2 637 

<210> 205 
<211> 531 
<212> DNA 
<213> E. Coli 

<400> 205 

atgagaaaca aaccttttta tcttctgtgc gcttttttgt ggctggcggt gagtcacgct 60 

ttggctgcgg atagcacgat tactatccgc ggctatgtca gggataacgg ctgtagtgtg 120 

gccgctgaat caaccaattt tactgttgat ctgatggaaa acgcggcgaa gcaatttaac 180 

aacattggcg cgacgactcc tgttgttcca tttcgtattt tgctgtcacc ctgtggtaat 240 

gccgtttctg ccgtaaaggt tgggtttact ggcgttgcag atagccacaa tgccaacctg 300 

cttgcacttg aaaatacggt gtcagcggct tcgggactgg gaatacagct tctgaatgag 360 

cagcaaaatc aaatacccct taatgctcca tcgtccgcgc tttcgtggac gaccctgacg 420 
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ccgggtaaac caaatacgct gaatttttac gcccggctaa tggcgacaca ggtgcctgtc 
actgcggggc atatcaatgc cacggctacc ttcactcttg aatatcagta a 



480 
531 



<210> 206 
<211> 504 
<212> DNA 
<213> E. Coli 

<400> 206 

atgaaatggt gcaaacgtgg gtatgtattg gcggcaatat tggcgctcgc aagtgcgacg 60 

atacaggcag ccgatgtcac catcacggtg aacggtaagg tcgtcgccaa accgtgtacg 120 

gtttccacca ccaatgccac ggttgatctc ggcgatcttt attctttcag tcttatgtct 180 

gccggggcgg catcggcctg gcatgatgtt gcgcttgagt tgactaattg tccggtggga 240 

acgtcgaggg tcactgccag cttcagcggg gcagccgaca gtaccggata ttataaaaac 300 

caggggaccg cgcaaaacat ccagttagag ctacaggatg acagtggcaa cacattgaat 360 

actggcgcaa ccaaaacagt tcaggtggat gattcctcac aatcagcgca cttcccgtta 420 

caggtcagag cattgacagt aaatggcgga gccactcagg gaaccattca ggcagtgatt 480 

agcatcacct atacctacag ctga 504 

<210> 207 
<211> 903 
<212> DNA 
<213> E. Coli 

<400> 207 

atgaaacgag ttattaccct gtttgctgta ctgctgatgg gctggtcggt aaatgcctgg 60 

tcattcgcct gtaaaaccgc caatggtacc gctatcccta ttggcggtgg cagcgccaat 120 

gtttatgtaa accttgcgcc cgtcgtgaat gtggggcaaa acctggtcgt ggatctttcg 180 

acgcaaatct tttgccataa cgattatccg gaaaccatta cagactatgt cacactgcaa 240 

cgaggctcgg cttatggcgg cgtgttatct aatttttccg ggaccgtaaa atatagtggc 300 

agtagctatc catttcctac caccagcgaa acgccgcgcg ttgtttataa ttcgagaacg 360 

gataagccgt ggccggtggc gctttatttg acgcctgtga gcagtgcggg cggggtggcg 420 

attaaagctg gctcattaat tgccgtgctt attttgcgac agaccaacaa ctataacagc 480 

gatgatttcc agtttgtgtg gaatatttac gccaataatg atgtggtggt gcctactggc 540 

ggctgcgatg tttctgctcg tgatgtcacc gttactctgc cggactaccc tggttcagtg 600 

ccaattcctc ttaccgttta ttgtgcgaaa agccaaaacc tggggtatta cctctccggc 660 

acaaccgcag atgcgggcaa ctcgattttc accaataccg cgtcgttttc acctgcacag 720 

ggcgtcggcg tacagttgac gcgcaacggt acgattattc cagcgaataa cacggtatcg 780 

ttaggagcag tagggacttc ggcggtgagt ctgggattaa cggcaaatta tgcacgtacc 840 

ggagggcagg tgactgcagg gaatgtgcaa tcgattattg gcgtgacttt tgtttatcaa 900 

taa 9Q3 

<210> 208 
<211> 1631 
<212> DNA 
<213> E. Coli 

<400> 208 

gtgctgtcaa aactaccccg tagactccga tcttttcaaa catattgcac catccgtgta 60 

catcggggtg aggatatgaa atcaatggat aagttaacaa caggtgttgc ctatggcaca 120 

tcggcgggta atgctggttt ctgggcattg cagttactcg ataaagtaac tccgtcacag 180 

tgggctgcaa tcggtgtgct gggtagcctg gtttttggcc tgctgacgta tctgacaaat 240 

ctttatttca agattaaaga agacaggcgt aaggctgcga gaggagagta atccaatgac 300 

tcaagactat gaactggttg tgaaaggagt ccgtaatttt gagaataaag ttacggtaac 360 

tgtagcctta caggacaaag aacgctttga cggtgaaatt tttgacctgg atgtcgccat 420 

ggaccgtgtt gaaggagctg cgctggagtt ttatgaggca gcagccagaa ggagcgtccg 480 

gcaagtcttc ctggaagtag cagaaaaatt gtcagaaaaa gttgagtctt atctgcagca 540 
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tcagtactcc 
tctatgaaca 
ggcgcatctg 
atggcatacc 
ggaaaaaccg 
attgagcgtg 
ccacaaaaag 
ccgtcgacgt 
cgctggtgga 
caggttattc 
agatattcat 
accgaggatg 
atgctcaatt 
ttaaagatgc 
gtgttaacca 
ataccgcaga 
tcaccaacat 
aacgctacat 
atcaaatgta 



tttaagattg 
caaaaatcag 
ctcctcagat 
gcgatggttc 
tttttcccaa 
ataaggcgct 
cgggtatcgc 
tttataagcg 
ttaaggatgg 
gtcgtgacca 
ggtgattttt 
gcaaaaaaaa 
tgctgctcga 
gcaacagaaa 
gttgcgtgcc 
tcttgccgct 
gctcggagat 
cgcaggagtg 

g 



aaaatcctgc 
atacggcctg 
acttgaccag 
tggcatatgg 
tatgaaactg 
ggcatgggtg 
gtcattttgt 
gctgaatgct 
cggacgcgat 
ggagagcgca 
ctcgtgttgt 
tgggcggaac 
ataattgaac 
tctgctgaaa 
gaagcaaaaa 
gccgtcagag 
attgcagcag 
acttgtcaac 



caataagcac 
tcggctgccg 
tttctggacg 
accatctgtc 
tcgaaggaaa 
gagcgcaata 
ccctataaca 
ggtgatcgta 
tgccgcattc 
ttaacctgct 
caggatttat 
gtgatgctgc 
aggggcgaac 
tttctgccag 
aatatgccat 
gcaaaacaac 
aagctcagct 
agatctatga 
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1380 


acgccttgac 


gcagcgaagc 


1440 


caaaaccgcc 


gaaggaatgc 


1500 


ttatgctgaa 


attgctgacg 


1560 


atctttaaga 


gataaaaagc 


1620 






1631 



<210> 209 
<211> 534 
<212> DNA 
<213> E. Coli 



<400 
atgaacacaa 
gcatctgctc 
gcataccgcg 
aaaaccgttt 
gagcgtgata 
caaaaagcgg 
tcgacgtttt 
tggtggatta 
gttattcgtc 



> 209 
aaatcagata 
ctcagatact 
atggttctgg 
ttcccaatat 
aggcgctggc 
gtatcgcgtc 
ataagcggct 
aggatggcgg 
gtgaccagga 



cggcctgtcg 
tgaccagttt 
catatggacc 
gaaactgtcg 
atgggtggag 
attttgtccc 
gaatgctggt 
acgcgattgc 
gagcgcatta 



gctgccgttc 
ctggacgaaa 
atctgtcggg 
aaggaaaaat 
cgcaatatta 
tataacattg 
gatcgtaaag 
cgcattcgtt 
acctgctggg 



tggcgctgat 


tggtgctggc 


60 


aagaaggtaa 


ccacacaatg 


120 


gtgccacagt 


ggtggatgga 


180 


gcgaccaggt 


caacgccatt 


240 


aagtaccact 


gaccgaacca 


300 


gccccggtaa 


gtgtttcccg 


360 


gtgcatgcga 


agcgattcgc 


420 


caaataactg 


ttacggtcag 


480 


ggatagaaca 


gtga 


534 



<210> 210 
<211> 312 
<212> DNA 
<213> E. Coli 



<400> 210 

atgactcaag actatgaact ggttgtgaaa ggagtccgta 
gtaactgtag ccttacagga caaagaacgc tttgacggtg 
gccatggacc gtgttgaagg agctgcgctg gagttttatg 
gtccggcaag tcttcctgga agtagcagaa aaattgtcag 
cagcatcagt actcctttaa gattgaaaat cctgccaata 
aaatatctat ga 



attt tgagaa 


taaagttacg 


60 


aaatttttga 


cctggatgtc 


120 


aggcagcagc 


cagaaggagc 


180 


aaaaagttga 


gtcttatctg 


240 


agcacgagcg 


tcctcatcat 


300 






312 



<210> 211 
<211> 291 
<212> DNA 
<213> E. Coli 



<400> 211 

gtgctgtcaa aactaccccg tagactccga tcttttcaaa catattgcac catccgtgta 60 
catcggggtg aggatatgaa atcaatggat aagttaacaa caggtgttgc ctatggcaca 120 
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tcggcgggta atgctggttt ctgggcattg cagttactcg ataaagtaac tccgtcacag 
tgggctgcaa tcggtgtgct gggtagcctg gtttttggcc tgctgacgta tctgacaaat 
ctttatttca agattaaaga agacaggcgt aaggctgcga gaggagagta a 



180 
240 
291 



<210> 212 
<211> 216 
<212> DNA 
<213> E. Coli 

<400> 212 

atgtcaaata aaatgactgg tttagtaaaa tggtttaacg ctgataaagg tttcggcttt 60 

atttctcctg ttgatggtag taaagatgtg tttgtgcatt tttctgcgat tcagaatgat 120 

aattatcgaa ccttatttga aggtcaaaag gttaccttct ctatagagag tggtgctaaa 180 

ggtcctgcag cagcaaatgt catcattact gattaa 216 

<210> 213 

<211> 1017 

<212> DNA 

<213> E. Coli 



<210> 214 
<211> 474 
<212> DNA 
<213> E. Coli 

<400> 214 

atggtatata taataatcgt ttcccacgga catgaagact acatcaaaaa attactcgaa 60 

aatcttaatg ctgacgatga gcactacaag attatcgtac gcgacaacaa agactctcta 120 

ttattgaaac aaatatgcca gcattatgca ggcctggact atattagtgg aggtgtatac 180 

ggctttggtc ataataataa tattgcggtg gcgtatgtaa aggaaaaata tagacccgca 240 

gatgatgatt acattttgtt tttgaatccc gatatcatca tgaagcatga tgatttgctg 300 

acatatatta aatatgtcga aagtaagcgt tatgctttta gtacattatg cctgttccga 360 

gatgaagcga aatctttaca tgattattcc gtaagaaaat ttcctgtgct ttctgatttt 420 

attgtgtcat ttatgttagg gattaaggaa ggtgcgaaca agtccctgat atga 474 

<210> 215 
<211> 1119 
<212> DNA 



<400> 213 



atgtttgtca 
gacagtgaat 
gagcagattc 
ggtaatggcc 
tggtacaacc 
ctgtttgccc 
cgccacctgc 
ctggccgaag 
gcacccagct 
aaaggcaatc 
ctgacccaca 
ctgctgcatg 
cgcgaggagc 
agaaccttga 
gccagcatcc 
gtgaaagcca 
ctggccaacc 



tctggagcca tagaacaggg ttcatcatga gtcatcaact taccttcgcc 
tcagcagtaa gcgccgtcag accagaaaag agattttctt gtcccgcatg 
tgccatggca aaacatggtg gaagtcatcg agccgtttta ccccaaggct 
ggcgacctta tccgctggaa accatgctac gcattcactg catgcagcat 
tgagcgatgg cgcgatggaa gatgctctgt acgaaatcgc ctccatgcgt 
ggttatccct ggatagcgcc ttgccggacc gcaccaccat catgaatttc 
tggagcagca tcaactggcc cgccaattgt tcaagaccat caatcgctgg 
caggcgtcat gatgactcaa ggcaccttgg tcgatgccac catcattgag 
cgaccaagaa caaagagcag caacgcgatc cggagatgca tcagaccaag 
agtggcactt tggcatgaag gcccacattg gtgtcgatgc caagagtggc 
gcctggtcac caccgcggcc aacgagcatg acctcaatca gctgggtaat 
gagaggagca atttgtctca gccgatgccg gctaccaagg ggcgccacag 
tggccgaggt ggatgtggac tggctgatcg ccgagcgccc cggcaaggta 
aacagcatcc acgcaagaac aaaacggcca tcaacatcga atacatgaaa 
gggccagggt ggagcaccca tttcgcatca tcaagcgaca gttcggcttc 
gatacaaggg gttgctgaaa aacgataacc aactggcgat gttattcacg 
tgtttcgggc ggaccaaatg atacgtcagt gggagagatc tcactaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 
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<213> E. Coli 



<400> 215 



atgggaaaaa 
attttgaaaa 
gtccattctg 
gaggttaaag 
tcaaaagagc 
actaaaaaaa 
cgtgaaattc 
aaaataaaca 
tttatcaaga 
gataaaagcc 
atattttacc 
aggaaattga 
gcgtatgcaa 
tacttggata 
tctaggttag 
ttagcatcag 
tttgattcta 
ctcaaaaaag 
tttgatgaac 



gcatagtcgt 
aatttttggc 
ctaaagagtt 
ggtcgtggct 
tgaatgctac 
gatatgtgta 
ttatggagcc 
ttaaaaaaaa 
aatattctat 
aacttactga 
ctgctgttcc 
aagaacaatc 
aatatattat 
aagaaaaaat 
aaacatgggg 
atttcccatt 
ataacgatga 
atatctctga 
tagttaattt 



tgtttctgcg gtcaatttta ccactggcgg tccatttacc 
agcaactaat aataaagaaa atgtcagttt tatcgcatta 
aaaagaaagt tatccatggg ttaaattcat tgagtttcct 
aaaacgtttg cactttgaat atgtagtttg taaaaaactt 
gcattggatt tgtctgcatg atattacggc caatgtcgtc 
ttgtcataac cctgcacctt tttataaagg aattttattc 
tagctttttc ttatttaaaa tgctatacgg gctgatatat 
tactgcagtg tttgttcaac aattctggat gaaagaaaaa 
aaataacatc attgtcagtc ggccagaaat taaattatct 
tgatgattct caatttaaga ataacccttc tgagttgaca 
acgagtattt aaaaattacg agcttattat tagtgcagca 
caatattaaa tttctgctta ctatcagtgg tacagaaaat 
cagtcttgca gaaggactgg ataatgttca tttcctcggg 
cgatcattgt tataatattt cagatatagt ttgttttccc 
attgccgttg tctgaagcta aagagcgagg taagtgggta 
tactagagaa actcttggta gttatgaaaa gaaagctttt 
catgttagtt aaacttatta ttgacttcaa aaaaggtaac 
tgcaaatttc atttatcgta atgaaaatgt attagttggg 
tattactgaa gaacattga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1119 



<210> 216 
<211> 591 
<212> DNA 
<213> E. Coli 

<400> 216 

atgatcttaa aactcgctaa acgatatggt ctctgtggtt ttattcggct tgttagagat 60 

gtcttattga ctcgtgtatt ttaccggaac tgtagaatta ttcgatttcc ctgctatatt 120 

cgcaatgatg gtagcattaa ttttggtgaa aatttcacaa gtggagtcgg tctcaggctg 180 

gatgcatttg gacgtggcgt gatttttttt tccgataatg tgcaagttaa cgactatgtt 240 

catatcgcct caattgagag cgttacgata ggtcgggata cgcttattgc aagtaaagta 300 

tttattaccg atcataatca cggttccttt aagcactctg atccaatgag ttcgccaaat 360 

atacctccag acatgcgcac gttggaatct tcagctgttg taattggcca gagggtttgg 420 

ttgggtgaga atgtgacggt tttgcctgga acaattattg gtaatggagt cgtagtcggc 480 

gccaattctg ttgttagagg ttctattccc gaaaatactg tcattgcggg agtaccagca 540 

aaaatcataa agaaatacaa tcatgagacc aaattatggg aaaaagcata g 591 

<210> 217 
<211> 993 
<212> DNA 
<213> Coli 

<400> 217 

atgtattttt tgaatgattt aaatttctct agacgcgatg ctggatttaa agcaagaaaa 60 

gatgcactgg acattgcttc agattatgaa aacatttctg ttgttaacat tcctctatgg 120 

ggtggagtag tccagagaat tattagttct gttaagctta gtacatttct ctgcggtctt 180 

gaaaataaag atgttttaat tttcaatttc ccgatggcca aaccattttg gcatatattg 240 

tcattctttc accgccttct aaaatttaga atagtacctc tgattcatga tattgatgaa 300 

ttaagaggag gagggggtag tgattctgtg cggcttgcta cctgtgatat ggtcataagt 360 

cacaatccac aaatgacaaa gtaccttagt aaatatatgt ctcaggataa aatcaaagac 420 

ataaaaatat ttgattacct cgtctcatct gatgtggagc atcgagatgt tacggataag 480 

caacgagggg tcatatatgc tggcaacctt tctaggcata aatgttcttt catatatact 540 

gaaggatgcg attttactct ctttggtgtc aactatgaaa ataaagataa tcctaaatat 600 

cttggaagtt ttgatgctca atctccggaa aagattaacc tcccaggcat gcaatttgga 660 
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ctcatttggg atggagattc tgtcgaaacc tgtagtggtg cctttggcga ctatttaaag 
tttaataacc ctcataagac atctctttat ctttcaatgg aacttccagt atttatatgg 
gataaagccg cccttgcgga tttcattgta gataatagaa taggatatgc agtgggatca 
atcaaagaaa tgcaagagat tgttgactcc atgacaatag aaacttataa gcaaattagt 
gagaatacaa aaattatttc tcagaaaatt cgaacaggaa gttacttcag ggatgttctt 
gaagaggtga tcgatgatct taaaactcgc taa 



720 
780 
840 
900 
960 
993 



<210> 218 
<211> 1167 
<212> DNA 
<213> E. Coli 

<400> 218 

atgatctatc ttgtaattag tgtctttctc attacagcat ttatctgttt atatcttaag 60 

aaggatatat tttatccagc cgtatgcgtt aatatcatct tcgcactggt cttattggga 120 

tatgaaataa cgtcagatat atatgctttt cagttaaatg acgctacgtt gatttttcta 180 

ctttgcaatg ttttgacatt taccctgtca tgtttattga cggaaagtgt attagatcta 240 

aatatcagaa aagtcaataa tgctatttat agcataccat cgaagaaagt gcataatgta 300 

ggcttgttag ttatttcttt ttcgatgata tatatatgca tgaggttaag taactaccag 360 

ttcgggacta gcttacttag ctatatgaat ttgataagag atgctgatgt tgaagacaca 420 

tcaagaaatt tctcagcata catgcagcca atcattctaa ctacttttgc tttatttatt 480 

tggtctaaaa aatttactaa tacaaaggta agtaaaacat ttactttact tgtttttatt 540 

gtattcatct ttgcaattat actgaatact ggtaagcaaa ttgtctttat ggttatcatc 600 

tcttatgcat tcatcgtagg tgttaataga gtaaaacatt atgtttatct tattacagct 660 

gtaggtgttc tattctcctt gtatatgctc tttttacgtg gactgcctgg ggggatggca 720 

tattatctat ccatgtattt ggtcagccct ataatcgcgt ttcaggagtt ttattttcag 780 

caagtatcta actctgccag ttctcatgtc ttttggtttt ttgaaaggct gatggggcta 840 

ttaacaggtg gagtctctat gtcgttgcat aaagaatttg tgtgggtggg tttgccaaca 900 

aatgtttata ctgctttttc ggattatgtt tatatttccg cggagctaag ctatttgatg 960 

atggttattc atggctgtat ttcaggtgtt ttatggagat tgtctcgaaa ttacatatct 1020 

gtgaaaatat tttattcata ttttatttat accttttctt tcatttttta tcatgaaagc 1080 

ttcatgacta atattagcag ttggatacaa ataactcttt gtatcatagt attctctcaa 1140 

tttcttaagg cccagaaaat aaagtga 1167 

<210> 219 
<211> 1104 
<212> DNA 
<213> E. Coli 



<400> 219 



atgtacgatt 
aaaaagctaa 
tacacagagg 
aatgataaat 
tctccactgg 
caccaaatgt 
aaaaagtatg 
gaggacttat 
aaagaattgc 
tatttttccg 
atgcttgaag 
gcgagtaaag 
tttggagcgt 
ttccaaggga 
gagcataaac 
ttagagtgga 
ctttttaaga 



atatcattgt tggttctggt ttgtttggtg ccgtttgtgc gaatgagtta 
acaaaaaagt tttagtgatt gagaaaagaa atcatatcgg tggaaatgcg 
actgtgaggg tatccagatt cataaatatg gtgcacatat ttttcatacc 
atatatggga ttacgttaat gatttagtag aatttaatcg ttttactaat 
cgatttataa agacaaatta ttcaaccttc cttttaatat gaatactttc 
ggggagttaa agatcctcaa gaagctcaaa atatcattaa tgctcagaaa 
gtgacaaggt acctgaaaat ttggaggagc aggcgatttc attagttggg 
accaagcatt gataaagggt tatacggaga agcagtgggg aagaagtgca 
ctgcatttat tattaagcga atcccagtga gatttacgtt tgataacaat 
atcgctatca aggtattccg gtgggaggct acactaagct tattgaaaaa 
gtgtggacgt aaaattaggc attgattttt tgaaagacaa agattctcta 
cccatagaat catctacact ggacccattg atcagtactt cgactatagg 
tagaatatcg ctctttaaaa tttgagacgg aacgccatga atttccaaac 
atgcagtaat aaatttcact gatgctaatg taccatatac cagaataatt 
attttgacta tgttgagaca aagcatacgg ttgttacaaa agaatatcca 
aagttggcga cgaaccctac tatccagtta atgataataa aaacatggag 
aatatagaga gttagctagc agagaagaca aggttatatt tggcgggcgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



-80- 



ttggccgagt ataaatatta tgatatgcat caagtgatat ctgccgctct ttatcaagtg 
aaaaatataa tgagtacgga ttaa 



1080 
1104 



<210> 220 
<211> 1116 
<212> DNA 
<213> E. Coli 



<400> 220 

atgttcccaa aaataatgaa tgatgaaaac tttttcaaaa aagcggcggc gcacggggag 60 

gaacctcctt taactcctca aaacgaacat cagcggtccg ggctgcgctt cgcccgtcgc 120 

gtcagactac cccgtgcggt tggcctggct ggcatgttct taccgattgc ttcaacgctg 180 

gtttcacacc cgccgccggg ctggtggtgg ctggtgttgg tcggctgggc gttcgtctgg 240 

ccgcatttag cctggcagat agcgagcagg gccgtcgatc cgcttagccg ggaaatttac 300 

aacttaaaaa ccgatgcagt attagcggga atgtgggtag gcgtaatggg cgtaaacgtg 360 

ctgccttcca ccgcgatgtt gatgattatg tgtctgaatt tgatgggggc aggcggcccc 420 

cgtctgtttg tcgcgggtct ggtgttgatg gtggtttcct gccttgtcac cctcgagctg 4 80 

acgggcatta ccgtgtcgtt caatagtgcg ccgctggaat ggtggctctc ccttcccatt 540 

attgtcattt atcctctgct gtttggctgg gtcagctacc agacggcaac caaactggcg 600 

gaacataaac gcaggttgca ggtcatgagt acccgcgacg gcatgacggg cgtgtataac 660 

cgacgtcatt gggaaactat gttacgcaat gaatttgata actgtcggcg gcataatcgc 720 

gatgcaacgt tactgattat cgatatcgac catttcaaga gcatcaacga tacctggggc 780 

catgatgtgg gcgatgaagc gattgtggcg cttacccgac agttacaaat taccctgcgc 840 

ggtagcgatg tgattggtcg gtttggcggc gatgagtttg cagtaatcat gtccggtacg 900 

ccagctgaga gcgccattac cgccatgtta cgggtgcatg aagggctaaa tacattacgt 960 

ttgccgaata cgccacaggt aactttacgg attagtgtgg gggttgcgcc gctgaaccca 1020 

caaatgagtc actatcgtga gtggttgaaa tcggcagatt tggcgcttta caaagcaaag 1080 

aaagccggac gtaaccgcac cgaagtggcg gcctga 1116 

<210> 221 
<211> 1404 
<212> DNA 
<213> E. Coli 



<400> 221 

ttggatgtga acgttgatca gttcgatact gaagctttcc gtactgacaa actggaactg 60 

accagcggca acatcgctga ccataacggt aacgtagtat ctggtgtgtt cgatatccat 120 

agcagcgatt acgttctgaa cgctgatctg gtgaacgacc gtacctggga tacttccaag 180 

tctaactacg gttacggtat tgttgctatg aactctgatg gtcacctgac tatcaacggt 240 

aacggcgacg tagacaacgg tactgaactg gataacagct ctgtagacaa tgttgttgct 300 

gcaaccggta actacaaagt tcgtatcgac aacgcaactg gcgctggcgc tatcgctgat 360 

tacaaagata aagaaattat ctacgtaaac gacgtcaaca gcaacgcgac cttctctgct 420 

gctaacaaag ctgacctggg tgcatacacc tatcaggctg aacagcgcgg taacaccgtt 480 

gttctgcaac agatggagct gaccgactac gctaacatgg cgctgagcat cccgtctgcg 540 

aacaccaata tctggaacct ggaacaagac accgttggta ctcgtctgac caactctcgt 600 

catggcctgg ctgataacgg cggcgcatgg gtaagctact tcggtggtaa cttcaacggc 660 

gacaacggca ccatcaacta tgatcaggat gttaacggca tcatggtcgg tgttgatacc 720 

aaaattgacg gtaacaacgc taagtggatc gtcggtgcgg ctgcaggctt cgctaaaggt 780 

gacatgaatg accgttctgg tcaggtggat caagacagcc agactgccta catctactct 840 

tctgctcact tcgcgaacaa cgtctttgtt gatggtagct tgagctactc tcacttcaac 900 

aacgacctgt ctgcaaccat gagcaacggt acttacgttg acggtagcac caactccgac 960 

gcttggggct tcggtttgaa agccggttac gacttcaaac tgggtgatgc tggttacgtg 1020 

actccttacg gcagcgtttc tggtctgttc cagtctggtg atgactacca gctgagcaac 1080 

gacatgaaag ttgacggtca gtcttacgac agcatgcgtt atgaactggg tgtagatgca 1140 

ggttatacct tcacctacag cgaagatcag gctctgactc cgtacttcaa actggcttac 1200 

gtctacgacg actctaacaa cgataacgat gtgaacggcg attccatcga taacggtact 1260 

gaagggtctg cggtacgtgt tggtctgggt actcagttta gcttcaccaa gaacttcagc 1320 
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gcctataccg atgctaacta cctcggtggt ggtgacgtag atcaagactg gtccgcgaac 1380 
gtgggtgtta aatatacctg gtaa 1404 

<210> 222 
<211> 669 
<212> DNA 
<213> E. Coll 

<400> 222 

atgcccgtca aggatttgac gggcattact gcaaaggacg cgcaaatgtt atctgtagtt 60 

aaacctcttc aggaatttgg taagctcgat aaatgtttgt ccagatacgg tacgcgcttc 120 

gagtttaata atgaaaagca agttatattt tccagtgatg tcaataacga agatactttc 180 

gttattttag agggagttat ctctctgcgt agagaagaaa acgtacttat cggtattacc 240 

caggctcctt atattatggg gctggctgat ggtttaatga aaaacgatat accatacaaa 300 

ttaatatcag aaggaaattg tacgggatat catctaccag ccaaacaaac cattacgctt 360 

attgaacaaa atcaactctg gcgagacgct ttttactggt tagcctggca aaatagaatt 420 

ctggaattac gcgacgtgca gctcattggg cataattcct acgaacaaat ccgcgcaaca 480 

ttattatcaa tgattgactg gaatgaagaa ttgcgatcac gtattggtgt gatgaattat 540 

atccatcaac gtacacgcat atcgcgttct gtcgtcgcag aagttctcgc tgctttgcgt 600 

aaaggcggct atatcgaaat gaataaaggc aaactggtcg ctatcaaccg tttgccttca 660 

gagtattaa 559 

<210> 223 
<211> 255 
<212> DNA 
<213> E. Coll 

<400> 223 

atgaccgata aaatccgtac tctgcaaggt cgcgttgtta gcgacaaaat ggagaaatcc 60 

attgttgttg ctatcgaacg ttttgtgaaa cacccgatct acggtaaatt catcaagcgt 120 

acgaccaaac tgcacgtaca tgacgagaac aacgaatgcg gtatcggtga cgtggttgaa 180 

atccgcgaat gccgtccgct gtccaagact aaatcctgga cgctggttcg cgttgtagag 240 

aaagcggttc tgtaa 255 

<210> 224 
<211> 192 
<212> DNA 
<213> E. Coll 

<400> 224 

atgaaagcaa aagagctgcg tgagaagagc gttgaagagc tgaacaccga gctgctgaac 60 

ctgctgcgtg agcagttcaa cctgcgtatg caggctgcaa gtggccagct gcaacagtct 120 

cacctgttga agcaagtgcg tcgcgatgtc gcacgcgtta agactttact gaacgagaag 180 

gcgggtgcgt aa ]_92 

<210> 225 

<211> 411 

<212> DNA 

<213> E. Coll 



<400 
atgttacaac 
gcgcagggta 
ctgactgccc 
ggtaagatct 
cgtatgggta 
gtcctgtatg 



> 225 
caaagcgtac 
cggatgttag 
gtcagatcga 
ggatccgtgt 
aaggtaaagg 
aaatggacgg 



aaaattccgt 
cttcggcagc 
agcagcacgt 
gttcccggac 
taacgtggag 
tgttccggaa 



aaaatgcaca 
ttcggtctga 
cgtgctatga 
aaaccgatca 
tattgggttg 
gagctggccc 



aaggccgtaa 
aagctgttgg 
cccgtgcagt 
ctgaaaagcc 
ccttgattca 
gtgaagcatt 



ccgcggtctg 
ccgtggtcgt 
taagcgtcaa 
gctggcagtg 
gccgggtaaa 
caagctggca 



60 
120 
180 
240 
300 
360 
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gcagcgaaac tgccgattaa aaccaccttt gtaactaaga cggtgatgta a 



411 



<210> 226 
<211> 702 
<212> DNA 
<213> E. Coli 

<400> 226 

atgggtcaga aagtacatcc taatggtatt cgcctgggta ttgtaaaacc atggaactct 60 

acctggtttg cgaacaccaa agaattcgct gacaacctgg acagcgattt taaagtacgt 120 

cagtacctga ctaaggaact ggctaaagcg tccgtatctc gtatcgttat cgagcgtccg 180 

gctaagagca tccgtgtaac cattcacact gctcgcccgg gtatcgttat cggtaaaaaa 24 0 

ggtgaagacg tagaaaaact gcgtaaggtc gtagcggaca tcgctggcgt tcctgcacag 300 

atcaacatcg ccgaagttcg taagcctgaa ctggacgcaa aactggttgc tgacagcatc 360 

acttctcagc tggaacgtcg cgttatgttc cgtcgtgcta tgaagcgtgc tgtacagaac 420 

gcaatgcgtc tgggcgctaa aggtattaaa gttgaagtta gcggccgtct gggcggcgcg 4 80 

gaaatcgcac gtaccgaatg gtaccgcgaa ggtcgcgtac cgctgcacac tctgcgtgct 540 

gacatcgact acaacacctc tgaagcgcac accacttacg gtgtaatcgg cgttaaagtg 600 

tggatcttca aaggcgagat cctgggtggt atggctgctg ttgaacaacc ggaaaaaccg 660 

gctgctcagc ctaaaaagca gcagcgtaaa ggccgtaaat aa 7 02 

<210> 227 
<211> 333 
<212> DNA 
<213> E. Coli 

<400> 227 

atggaaacta tcgctaaaca tcgccatgct cgttcttctg ctcagaaggt tcgccttgtt 60 

gctgacctga ttcgcggtaa gaaagtgtcg caggctctgg atattttgac ctacaccaac 120 

aagaaagcgg ctgtactggt caagaaagtt ctggaatctg ccattgctaa cgctgaacac 180 

aacgatggcg ctgacattga cgatctgaaa gttacgaaaa ttttcgtaga cgaaggcccg 240 

agcatgaagc gcattatgcc gcgtgcaaaa ggtcgtgcag atcgcatcct gaagcgcacc 300 

agccacatca ctgtggttgt gtccgatcgc tga 333 

<210> 228 
<211> 279 
<212> DNA 
<213> E. Coli 

<400> 228 

atgccacgtt ctctcaagaa aggtcctttt attgacctgc acttgctgaa gaaggtagag 60 

aaagcggtgg aaagcggaga caagaagccc ctgcgcactt ggtcccgtcg ttcaacgatc 120 

tttcctaaca tgatcggttt gaccatcgct gtccataatg gtcgtcagca cgttccggta 180 

tttgtaaccg acgaaatggt tggtcacaaa ctgggtgaat tcgcaccgac tcgtacttat 240 

cgcggccacg ctgctgataa aaaagcgaag aagaaataa 279 

<210> 229 
<211> 822 
<212> DNA 
<213> E. Coli 

<400> 229 

atggcagttg ttaaatgtaa accgacatct ccgggtcgtc gccacgtagt taaagtggtt 60 

aaccctgagc tgcacaaggg caaacctttt gctccgttgc tggaaaaaaa cagcaaatcc 120 

ggtggtcgta acaacaatgg ccgtatcacc actcgtcata tcggtggtgg ccacaagcag 180 

gcttaccgta ttgttgactt caaacgcaac aaagacggta tcccggcagt tgttgaacgt 240 

cttgagtacg atccgaaccg ttccgcgaac atcgcgctgg ttctgtacaa agacggtgaa 300 
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cgccgttaca tcctggcccc taaaggcctg aaagctggcg accagattca gtctggcgtt 360 

gatgctgcaa tcaaaccagg taacaccctg ccgatgcgca acatcccggt tggttctact 420 

gttcataacg tagaaatgaa accaggtaaa ggcggtcagc tggcacgttc cgctggtact 480 

tacgttcaga tcgttgctcg tgatggtgct tatgtcaccc tgcgtctgcg ttctggtgaa 540 

atgcgtaaag tagaagcaga ctgccgtgca actctgggcg aagttggcaa tgctgagcat 600 

atgctgcgcg ttctgggtaa agcaggtgct gcacgctggc gtggtgttcg tccgaccgtt 660 

cgcggtaccg cgatgaaccc ggtagaccac ccacatggtg gtggtgaagg tcgtaacttt 720 

ggtaagcacc cggtaactcc gtggggcgtt cagaccaaag gtaagaagac ccgcagcaac 780 

aagcgtactg ataaattcat cgtacgtcgc cgtagcaaat aa 822 

<210> 230 
<211> 303 
<212> DNA 
<213> E. Coli 

<400> 230 

atgattcgtg aagaacgtct gctgaaggtg ctgcgtgcac cgcacgtttc tgaaaaagcg 60 

tctactgcga tggaaaaatc caacaccatc gtactcaaag ttgctaaaga cgcgaccaaa 120 

gcagaaatca aagctgctgt gcagaaactg tttgaagtcg aagtcgaagt cgttaacacc 180 

ctggtagtta aagggaaagt taaacgtcac ggacagcgta tcggtcgtcg tagcgactgg 240 

aaaaaagctt acgtcaccct gaaagaaggc cagaatctgg acttcgttgg cggcgctgag 300 

303 

<210> 231 
<211> 630 
<212> DNA 
<213> E. Coli 

<400> 231 

atgattggtt tagtcggtaa aaaagtgggt atgacccgta tcttcacaga agacggcgtt 60 

tctatcccag taaccgtaat cgaagttgaa gcaaaccgcg ttactcaggt taaagacctg 120 

gctaacgatg gctaccgtgc tattcaggtg accaccggtg ctaaaaaagc taaccgtgtg 180 

accaagcctg aagctggcca cttcgctaaa gctggcgtag aagctggccg tggtctgtgg 240 

gaattccgcc tggctgaagg cgaagagttc actgtaggtc agagcattag cgttgaactg 300 

tttgctgacg ttaaaaaagt tgacgtaact ggcacctcta aaggtaaagg tttcgcaggt 360 

accgttaagc gctggaactt ccgtacccag gacgctactc acggtaactc cttgtctcac 420 

cgcgttccgg gttctatcgg tcagaaccag actccgggca aagtgttcaa aggcaagaaa 480 

atggcaggtc agatgggtaa cgaacgtgta accgttcaga gccttgacgt agtacgcgtt 54 0 

gacgctgagc gcaacctgct gctggttaaa ggtgctgtcc cgggtgcaac cggtagcgac 600 

ctgatcgtta aaccagctgt gaaggcgtaa 530 

<210> 232 
<211> 606 
<212> DNA 
<213> E. Coli 

<400> 232 

atggaattag tattgaaaga cgcgcagagc gcgctgactg tttccgaaac taccttcggt 60 

cgtgatttca acgaagcgct ggttcaccag gttgttgttg cttatgcagc tggtgctcgt 120 

cagggtactc gtgctcagaa gactcgtgct gaagtaactg gttccggtaa aaaaccgtgg 18 0 

cgccagaaag gcaccggccg tgcgcgttct ggttctatca agagcccgat ctggcgttct 240 

ggtggcgtga cctttgctgc tcgtccgcag gaccacagtc aaaaagttaa caagaagatg 300 

taccgcggcg cgctgaaaag catcctgtcc gaactggtac gtcaggatcg tctgatcgtt 360 

gtcgagaagt tctctgtaga agcgccgaaa actaagctgc tggcacagaa actgaaagac 420 

atggctctgg aagatgtgct gatcatcacc ggtgagctgg acgaaaacct gttcctggct 480 

gcgcgcaacc tgcacaaggt tgacgtacgc gatgcaactg gtatcgaccc ggttagcctg 540 

atcgccttcg acaaagtcgt aatgactgct gatgctgtta agcaagttga ggagatgctg 600 
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gcatga 



606 



<210> 233 
<211> 312 
<212> DNA 
<213> E. Coli 



aaagcgtttg 


atcatcgtct 


gatcgatcaa 


60 


cgcactggtg 


cgcaggtccg 


tggtccgatc 


120 


gttctgatct 


ccccgcacgt 


caacaaagac 


180 


ttgcgtctgg 


ttgacatcgt 


tgagccaacc 


240 


gatctggctg 


ccggtgtaga 


cgtgcagatc 


300 








312 



<400> 233 
atgcagaacc aaagaatccg tatccgcctg 
gcaaccgcgg aaatcgtcga gactgccaag 
ccgctgccga cacgcaaaga gcgcttcact 
gcgcgcgatc agtacgaaat ccgtactcac 
gagaaaaccg ttgatgctct gatgcgtctg 
agcctgggtt aa 



<210> 234 
<211> 357 
<212> DNA 
<213> E. Coli 

<400> 234 

atggctcgcg taaaacgtgg tgttattgca cgtgcacgtc acaagaaaat tttgaaacaa 

gctaaaggct actacggtgc gcgttctcgc gtataccgcg ttgccttcca ggctgttatc 

aaagctggtc agtatgctta ccgtgaccgt cgtcaacgta agcgtcagtt ccgtcaactg 

tggattgcgc gtatcaacgc agcagcacgt cagaacggta tttcttacag caaattcatc 

aatggcctga aaaaagcctc tgttgaaatc gaccgtaaga tcctggctga tatcgcagta 

ttcgacaaag tagcgttcac cgctctggtt gaaaaagcga aagcagctct ggcataa 

<210> 235 
<211> 198 
<212> DNA 
<213> E. Coli 



60 
120 
180 
240 
300 
357 



<400> 235 

atgccaaaaa ttaagaccgt acgcggtgct gctaagcgct tcaaaaaaac cggtaaaggt 60 

ggttttaagc acaagcacgc taacctgcgt cacattctga ccaaaaaagc gaccaaacgt 120 

aaacgtcacc tgcgtccgaa agccatggtt tccaaaggcg atctgggcct ggtaatcgcg 180 

tgcctgccgt acgcataa I93 

<210> 236 

<211> 543 

<212> DNA 

<213> E. Coli 



<400> 236 
attaaaggcg gaaaacgagt tcaaacggcg 
gcccaggaag ttcgcttaac aggtctggaa 
gaagctctgg agaaagcaga agaagccgga 
gagccgccgg tttgtcgtat aatggattac 
tctaaggaac agaagaaaaa gcaaaaagtt 
ggtacagatg aaggcgacta tcaggtaaaa 
ggtgataaag ccaaaatcac gctgcgtttc 
ggtatggaag tgcttaatcg cgtgaaagac 
ttcccaacga agatcgaagg ccgccagatg 
taa 



cgccctaacc 


gtatcaatgg 


cgaaattcgc 


60 


ggcgagcagc 


ttggtattgt 


gagtctgaga 


120 


gtagacttag 


tcgagatcag 


ccctaacgcc 


180 


ggcaaattcc 


tctatgaaaa 


gagcaagtct 


240 


atccaggtta 


aggaaattaa 


attccgtcct 


300 


ctccgcagcc 


tgattcgctt 


tctcgaagag 


360 


cgcggtcgtg 


agatggcgca 


ccagcaaatc 


420 


gatttgcaag 


aactggcagt 


ggtcgaatcc 


480 


atcatggtgc 


tcgctcctaa 


gaagaaacag 


540 








543 



<210> 237 
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<211> 1929 
<212> DNA 
<213> E. Coli 



<400> 237 

atgcctgtta taactcttcc tgatggcagc caacgccatt acgatcacgc tgtaagcccc 60 

atggatgttg cgctggacat tggtccaggt ctggcgaaag cctgtatcgc agggcgcgtt 120 

aatggcgaac tggttgatgc ttgcgatctg attgaaaacg acgcacaact gtcgatcatt 180 

accgccaaag acgaagaagg tctggagatc attcgtcact cctgtgcgca cctgttaggg 240 

cacgcgatta aacaactttg gccgcatacc aaaatggcaa tcggcccggt tattgacaac 300 

ggtttttatt acgacgttga tcttgaccgc acgttaaccc aggaagatgt cgaagcactc 360 

gagaagcgga tgcatgagct tgctgagaaa aactacgacg tcattaagaa gaaagtcagc 420 

tggcacgaag cgcgtgaaac tttcgccaac cgtggggaga gctacaaagt ctccattctt 480 

gacgaaaaca tcgcccatga tgacaagcca ggtctgtact tccatgaaga atatgtcgat 540 

atgtgccgcg gtccgcacgt accgaacatg cgtttctgcc atcatttcaa actaatgaaa 600 

acggcagggg cttactggcg tggcgacagc aacaacaaaa tgttgcaacg tatttacggt 660 

acggcgtggg cagacaaaaa agcacttaac gcttacctgc agcgcctgga agaagccgcg 720 

aaacgcgacc accgtaaaat cggtaaacag ctcgacctgt accatatgca ggaagaagcg 780 

ccgggtatgg tattctggca caacgacggc tggaccatct tccgtgaact ggaagtgttt 840 

gttcgttcta aactgaaaga gtaccagtat caggaagtta aaggtccgtt catgatggac 900 

cgtgtcctgt gggaaaaaac cggtcactgg gacaactaca aagatgcaat gttcaccaca 960 

tcttctgaga accgtgaata ctgcattaag ccgatgaact gcccgggtca cgtacaaatt 1020 

ttcaaccagg ggctgaagtc ttatcgcgat ctgccgctgc gtatggccga gtttggtagc 1080 

tgccaccgta acgagccgtc aggttcgctg catggcctga tgcgcgtgcg tggatttacc 1140 

caggatgacg cgcatatctt ctgtactgaa gaacaaattc gcgatgaagt taacggatgt 1200 

atccgtttag tctatgatat gtacagcact tttggcttcg agaagatcgt cgtcaaactc 1260 

tccactcgtc ctgaaaaacg tattggcagc gacgaaatgt gggatcgtgc tgaggcggac 1320 

ctggcggttg cgctggaaga aaacaacatc ccgtttgaat atcaactggg tgaaggcgct 1380 

ttctacggtc cgaaaattga atttaccctg tatgactgcc tcgatcgtgc atggcagtgc 1440 

ggtacagtac agctggactt ctctttgccg tctcgtctga gcgcttctta tgtaggcgaa 1500 

gacaatgaac gtaaagtacc ggtaatgatt caccgcgcaa ttctggggtc gatggaacgt 1560 

ttcatcggta tcctgaccga agagttcgct ggtttcttcc cgacctggct tgcgccggtt 1620 

caggttgtta tcatgaatat taccgattca cagtctgaat acgttaacga attgacgcaa 1680 

aaactatcaa atgcgggcat tcgtgttaaa gcagacttga gaaatgagaa gattggcttt 1740 

aaaatccgcg agcacacttt gcgtcgcgtc ccatatatgc tggtctgtgg tgataaagag 1800 

gtggaatcag gcaaagttgc cgttcgcacc cgccgtggta aagacctggg aagcatggac 18 60 

gtaaatgaag tgatcgagaa gctgcaacaa gagattcgca gccgcagtct taaacaattg 1920 

gaggaataa 1929 

<210> 238 
<211> 1353 
<212> DNA 
<213> E. Coli 



<400> 238 

atgactaaac actatgatta catcgccatc ggcggcggca gcggcggtat cgcctccatc 60 

aaccgcgcgg ctatgtacgg ccagaaatgt gcgctgattg aagccaaaga gctgggcggc 120 

acctgcgtaa atgttggctg tgtgccgaaa aaagtgatgt ggcacgcggc gcaaatccgt 180 

gaagcgatcc atatgtacgg cccggattat ggttttgata ccactatcaa taaattcaac 240 

tgggaaacgt tgatcgccag ccgtaccgcc tatatcgacc gtattcatac ttcctatgaa 300 

aacgtgctcg gtaaaaataa cgttgatgta atcaaaggct ttgcccgctt cgttgatgcc 360 

aaaacgctgg aggtaaacgg cgaaaccatc acggccgatc atattctgat cgccacaggc 420 

ggtcgtccga gccacccgga tattccgggc gtggaatacg gtattgattc tgatggcttc 480 

ttcgcccttc ctgctttgcc agagcgcgtg gcggttgttg gcgcgggtta catcgccgtt 540 

gagctggcgg gcgtgattaa cggcctcggc gcgaaaacgc atctgtttgt gcgtaaacat 600 

gcgccgctgc gcagcttcga cccgatgatt tccgaaacgc tggtcgaagt gatgaacgcc 660 

gaaggcccgc agctgcacac caacgccatc ccgaaagcgg tagtgaaaaa taccgatggt 720 
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agcctgacgc tggagctgga agatggtcgc agtgaaacgg tggattgcct gatttgggcg 780 

attggtcgcg agcctgccaa tgacaacatc aacctggaag ccgctggcgt taaaactaac 840 

gaaaaaggct atatcgtcgt cgataaatat caaaacacca atattgaagg tatttacgcg 900 

gtgggcgata acacgggtgc agtggagctg acaccggtgg cagttgcagc gggtcgccgt 960 

ctctctgaac gcctgtttaa taacaagccg gatgagcatc tggattacag caacattccg 1020 

accgtggtct tcagccatcc gccgattggt actgttggtt taacggaacc gcaggcgcgc 1080 

gagcagtatg gcgacgatca ggtgaaagtg tataaatcct ctttcaccgc gatgtatacc 1140 

gccgtcacca ctcaccgcca gccgtgccgc atgaagctgg tgtgcgttgg atcggaagag 1200 

aagattgtcg gtattcacgg cattggcttt ggtatggacg aaatgttgca gggcttcgcg 1260 

gtggcgctga agatgggggc aaccaaaaaa gacttcgaca ataccgtcgc cattcaccca 1320 

acggcggcag aagagttcgt gacaatgcgt taa 1353 

<210> 239 
<211> 2904 
<212> DNA 
<213> E. Coli 

<400> 239 

aaggttaagc ctcacggttc attagtaccg gttagctcaa cgcatcgctg cgcttacaca 60 

cccggcctat caacgtcgtc gtcttcaacg ttccttcagg acccttaaag ggtcagggag 120 

aactcatctc ggggcaagtt tcgtgcttag atgctttcag cacttatctc ttccgcattt 180 

agctaccggg cagtgccatt ggcatgacaa cccgaacacc agtgatgcgt ccactccggt 240 

cctctcgtac taggagcagc ccccctcagt tctccagcgc ccacggcaga tagggaccga 300 

actgtctcac gacgttctaa acccagctcg cgtaccactt taaatggcga acagccatac 360 

ccttgggacc tacttcagcc ccaggatgtg atgagccgac atcgaggtgc caaacaccgc 420 

cgtcgatatg aactcttggg cggtatcagc ctgttatccc cggagtacct tttatccgtt 480 

gagcgatggc ccttccattc agaaccaccg gatcactatg acctgctttc gcacctgctc 540 

gcgccgtcac gctcgcagtc aagctggctt atgccattgc actaacctcc tgatgtccga 600 

ccaggattag ccaaccttcg tgctcctccg ttactcttta ggaggagacc gccccagtca 660 

aactacccac cagacactgt ccgcaacccg gattacgggt caacgttaga acatcaaaca 720 

ttaaagggtg gtatttcaag gtcggctcca tgcagactgg cgtccacact tcaaagcctc 780 

ccacctatcc tacacatcaa ggctcaatgt tcagtgtcaa gctatagtaa aggttcacgg 840 

ggtctttccg tcttgccgcg ggtacactgc atcttcacag cgagttcaat ttcactgagt 900 

ctcgggtgga gacagcctgg ccatcattac gccattcgtg caggtcggaa cttacccgac 960 

aaggaatttc gctaccttag gaccgttata gttacggccg ccgtttaccg gggcttcgat 1020 

caagagcttc gcttgcgcta accccatcaa ttaaccttcc ggcaccgggc aggcgtcaca 1080 

ccgtatacgt ccactttcgt gtttgcacag tgctgtgttt ttaataaaca gttgcagcca 1140 

gctggtatct tcgactgatt tcagctccat ccgcgaggga cctcacctac atatcagcgt 1200 

gccttctccc gaagttacgg caccattttg cctagttcct tcacccgagt tctctcaagc 1260 

gccttggtat tctctacctg accacctgtg tcggtttggg gtacgatttg atgttacctg 1320 

atgcttagag gcttttcctg gaagcagggc atttgttgct tcagcaccgt agtgcctcgt 1380 

catcacgcct cagccttgat tttccggatt tgcctggaaa accagcctac acgcttaaac 1440 

cgggacaacc gtcgcccggc caacatagcc ttctccgtcc ccccttcgca gtaacaccaa 1500 

gtacaggaat attaacctgt ttcccatcga ctacgccttt cggcctcgcc ttaggggtcg 1560 

actcaccctg ccccgattaa cgttggacag gaacccttgg tcttccggcg agcgggcttt 1620 

tcacccgctt tatcgttact tatgtcagca ttcgcacttc tgatacctcc agcatgcctc 1680 

acagcacacc ttcgcaggct tacagaacgc tcccctaccc aacaacgcat aagcgtcgct 1740 

gccgcagctt cggtgcatgg tttagccccg ttacatcttc cgcgcaggcc gactcgacca 1800 

gtgagctatt acgctttctt taaatgatgg ctgcttctaa gccaacatcc tggctgtctg 1860 

ggccttccca catcgtttcc cacttaacca tgactttggg accttagctg gcggtctggg 1920 

ttgtttccct cttcacgacg gacgttagca cccgccgtgt gtctcccgtg ataacattct 1980 

ccggtattcg cagtttgcat cgggttggta agtcgggatg acccccttgc cgaaacagtg 2040 

ctctaccccc ggagatgaat tcacgaggcg ctacctaaat agctttcggg gagaaccagc 2100 

tatctcccgg tttgattggc ctttcacccc cagccacaag tcatccgcta atttttcaac 2160 

attagtcggt tcggtcctcc agttagtgtt acccaacctt caacctgccc atggctagat 2220 

caccgggttt cgggtctata ccctgcaact taacgcccag ttaagactcg gtttcccttc 2280 

ggctccccta ttcggttaac cttgctacag aatataagtc gctgacccat tatacaaaag 2340 
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gtacgcagtc 
ttttcactcc 
gtcagtcagg 
tgtcccgccc 
tgtatcgcgc 
gctccccgtt 
ttagatgttt 
tgtcgaaaca 
ttaccgacgc 
ccgtgtacgc 



acacgcctaa 
cctcgccggg 
agtatttagc 
tactcatcga 
gcctttccag 
cgctcgccgc 
cagttccccc 
cactgggttt 
ttatcgcaga 
ttagtcgctt 



gcgtgctccc 
gttcttttcg 
cttggaggat 
gctcacagca 
acgcttccac 
tactggggga 
ggttcgcctc 
ccccattcgg 
ttagcacgtc 
aacc 



actgcttgta 
cctttccctc 
ggtcccccca 
tgtgcatttt 
taacacacac 
atctcggttg 
attaacctat 
aaatcgccgg 
cttcatcgcc 



cgtacacggt 
acggtactgg 
tattcagaca 
tgtgtacggg 
actgattcag 
atttcttttc 
ggattcagtt 
ttataacggt 
tctgactgcc 



ttcaggttct 
ttcactatcg 
ggataccacg 
gctgtcaccc 
gctctgggct 
ctcggggtac 
aatgatagtg 
tcatatcacc 
agggcatcca 



<210> 240 
<211> 120 
<212> DNA 
<213> E. Coli 

<400> 240 

atgcctggca gttccctact ctcgcatggg gagaccccac actaccatcg gcgctacggc 
gtttcacttc tgagttcggc atggggtcag gtgggaccac cgcgctacgg ccgccaggca 

<210> 241 

<211> 76 

<212> DNA 

<213> E. Coli 

<400> 241 

gtccccttcg tctagaggcc caggacaccg ccctttcacg gcggtaacag gggttcgaat 
cccctagggg acgcca 

<210> 242 

<211> 1549 

<212> DNA 

<213> E. Coli 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2904 



60 
120 



60 
76 



<400 
aaattgaaga 
gtcgaacggt 
tgtctgggaa 
aatgtcgcaa 
ggattagctt 
ggatgaccag 
ggaatattgc 
tcgggttgta 
gacgttaccc 
ggtgcaagcg 
agatgtgaaa 
cgtagagggg 
cggtggcgaa 
aaacaggatt 
ccttgaggcg 
aaggttaaaa 
ttcgatgcaa 
gaatgtgcct 
aaatgttggg 
ccgggaactc 
catcatggcc 
gacctcgcga 



> 242 
gtttgatcat 
aacaggaagc 
gctgcctgat 
gaccaaagag 
gttggtgggg 
ccacactgga 
acaatgggcg 
aagtactttc 
gcagaagaag 
ttaatcggaa 
tccccgggct 
ggtagaattc 
ggcggccccc 
agataccctg 
tggcttccgg 
ctcaaatgaa 
cgcgaagaac 
tcgggaaccg 
ttaagtcccg 
aaaggagact 
cttacgacca 
gagcaagcgg 



ggctcagatt 
agcttgctgc 
ggagggggat 
ggggaccttc 
taacggctca 
actgagacac 
caagcctgat 
agcggggagg 
caccggctaa 
ttactgggcg 
caacctggga 
caggtgtagc 
tggacgaaga 
gtagtccacg 
agctaacgcg 
ttgacggggg 
cttacctggt 
tgagacaggt 
caacgagcgc 
gccagtgata 
gggctacaca 
acctcataaa 



gaacgctggc 
ttcgctgacg 
aactactgga 
gggcctcttg 
ccaaggcgac 
ggtccagact 
gcagccatgc 
aagggagtaa 
ctccgtgcca 
taaagcgcac 
actgcatctg 
ggtgaaatgc 
ctgacgctca 
ccgtaaacga 
ttaagtcgac 
cccgcacaag 
cttgacatcc 
gctgcatggc 
aacccttatc 
aactggagga 
cgtgctacaa 
gtgcgtcgta 



ggcaggccta 
agtggcggac 
aacggtagct 
ccatcggatg 
gatccctagc 
cctacgggag 
cgcgtgtatg 
agttaatacc 
gcagccgcgg 
gcaggcgggt 
atactggcaa 
gtagagatct 
ggtgcgaaag 
tgtcgacttg 
cgcctgggga 
cggtggagca 
acggaagttt 
tgtcgtcagc 
ctttgttgcc 
SLggtggggat 
tggcgcatac 
gtccggattg 



acacatgcaa 
gggtgagtaa 
aataccgcat 
tgcccagatg 
tggtctgaga 
gcagcagtgg 
aagaaggcct 
tttgctcatt 
taatacggag 
tggttaagtc 
gcttgagtct 
ggaggaatac 
cgtggggagc 
gaggttgtgc 
gtacggccgc 
tgtggtttaa 
tcagagatga 
tcgtgttgtg 
agcggtccgg 
gacgtcaagt 
aaagagaagc 
gagtctgcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ctcgactcca tgaagtcgga atcgctagta atcgtggatc agaatgccac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accatgggag tgggttgcaa aagaagtagg 14 40 

tagcttaacc ttcgggaggg cgcttaccac tttgtgattc atgactgggg tgaagtcgta 1500 

acaaggtaac cgtaggggaa cctgcggttg gatcacctcc ttaccttaa 1549 



<210> 243 

<211> 221 
<212> PRT 
<213> E. Coll 



<400> 
Met Asn Val 
1 

Ser Leu Phe 

Leu Pro Val 
35 

Phe Pro Phe 
50 

Ala Pro Leu 
65 

Leu lie Ser 

Gly Phe Gly 

Thr Ala Ser 
115 

Val Phe Asn 
130 

Ala Ser Thr 
145 

lie Ala Phe 

Glu He Ala 

Phe Phe Leu 
195 

Leu Ala Asp 
210 



243 

Phe Ser 
5 

His Leu 
20 

Ser He 

He Phe 

Ala Arg 

Tyr Val 
85 

Ala Leu 
100 

Phe Met 

Arg Leu 

Leu Phe 

Trp Arg 
165 
Leu Val 
180 

Pro Met 
Lys Ser 



Gin Thr Gin 

Leu Val He 

Leu Gly Phe 
40 

Leu Ala Thr 
55 

Arg He He 
70 

He Ser Ser 

Ala His Phe 

Ala Tyr Ala 
120 

Arg Gin Ser 
135 

Gly Asn Val 
150 

Ser Pro Asp 

Asp Tyr Cys 

Tyr Gly Val 
200 

Glu He Asn 
215 



Arg 

Thr 

25 

His 

Asp 

Phe 

Leu 

Asn 
105 
Leu 

Arg 

Ser 

Ala 

Phe 
185 
Leu 

Ala 



Tyr 

10 

Ser 

Thr 

Leu 

Ala 

Phe 

90 

Leu 

Gly 

Arg 

Asp 

Phe 
170 
Lys 

Leu 

Leu 



Lys Ala Leu Phe 
Ser Asn 
Thr Trp 



Thr Val 
60 

Val Met 
75 

Tyr Met 

Phe Val 

Gin He 

Trp Trp 
140 
Thr Leu 
155 

Met Ala 

Val Leu 

Asn Met 

Gin Ala 
220 



Tyr Leu 

30 
Gly Ala 
45 

Arg He 



He Pro 

Gly Ser 

Ala Arg 
110 
Leu Asp 
125 

Leu Ala 

Ala Phe 

Glu His 

He Ser 
190 
Leu Leu 
205 
Ser 



Trp Leu 
15 

Val Gin 

Phe Ser 

Phe Gly 

Ala Leu 
80 

Trp Gin 
95 

He Ala 

Val His 

Pro Thr 

Phe Phe 
160 
Trp Met 
175 

He Val 
Lys Arg 



<210> 244 
<211> 203 
<212> PRT 
<213> E. Coli 



<400> 244 
Met He Arg Trp Met Asn Glu Pro 

1 5 
Lys Ser Met Arg Asn Leu Val Lys 
20 

Met Gly Leu Ala Ala Cys Asp Asp 

35 40 
Gly Ser Val Ala Glu Ser Asn Ala 

50 55 
Asp Gly Lys Leu Ser Phe Ser Leu 



Leu Trp Pro Phe He Glu Arg Lys 

10 15 
Tyr Val Gly He Gly Leu Leu Val 
25 30 
Lys Asp Thr Asn Ala Thr Ala Gin 
45 

Thr Gly Asn Pro Val Asn Leu Leu 
60 

Pro Ala Asp Met Thr Asp Gin Ser 
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65 










70 










75 










80 


Gly 


Lys 


Leu 


Gly 


Thr 


Gin 


Ala 


Asn 


Asn 


Met 


His 


Val 


Trp 


Ser 


Asp Ala 










85 










90 










95 




Thr 


Gly 


Gin 


Lys 


Ala 


Val 


He 


Val 


He 


Met 


Gly Asp Asp 


Pro 


Lys 


Glu 








100 










105 










110 






Asp 


Leu 


Ala 


Val 


Leu 


Ala 


Lys 


Arg 


Leu 


Glu Asp 


Gin 


'a-Ln 


Arg 


Ser 


Arg 






115 










120 










125 








Asp 


Pro 


Gin 


Leu 


Gin 


Val 


Val 


Thr 


Asn 


Lys 


Ala 


He 


U J_U 


Leu 


Lys 


biy 




130 










135 










140 








His 


Lys 


Met 


Gin 


Gin 


Leu 


Asp 


Ser 


He 


i±e 


Ser 


Ala 


Lys 


Gly Gin 


Thr 


145 










150 










155 










160 


Ala 


Tvr 


Ser 


Ser 


Val 


He 


Leu 


Gly 




Val 


Gly Asn 


bin 


Leu 


Leu 


Thr 










165 




















I/O 




Met 


Gin 


He 


Thr 


Leu 


Pro 


Ala 


Asp 






Gin 


Lys 


Ala 


bxn 


Thr 


inr 








180 










185 










190 






Ala 


Glu 


Asn 


He 


He 


Asn 


Thr 


Leu 


Val 


He 


Gin 
















195 










200 




















<210> 


245 




























<211> 


324 




























<212> 


PRT 




























<213> 


E. Coli 


























<400> 


245 


























Met 


Ala 


Asn 


Met 


Phe 


Ala 


Leu 


He 


Leu 


V a± 


He 


Ala 


Thr 


Leu 


Val 


Thr 


1 








5 










1 n 
1 U 










15 




Glv 


He 


Leu 


Trp 


Cys 


Val 


Asp 


Lys 


Phe 


irlie 


Phe 


Ala 


D V 

irr O 


Lys 


Arg 


Arg 








20 










25 










Pi 






Glu 


Arg 


Gin 


Ala 


Ala 


Ala 


Gin 


Ala 


Ala 


r-iJ_ d 


Gly Asp 


oer 


Leu 


Asp 


Lys 






35 










40 


















Ala 


Thr 


Leu 


Lys 


Lys 


Val 


Ala 


Pro 


Lys 




Gly 


Trp 


Leu 


blU 


Thr 


Gly 




50 










55 










60 








Ala 


Ser 


Val 


Phe 


Pro 


Val 


Leu 


Ala 


He 


Val 


Leu 


He 


V a± 


Arg 


Ser 


Phe 


65 










70 










75 








o n 
0 U 


He 


Tyr 


Glu 


Pro 


Phe 


Gin 


He 


Pro 


Ser 


Gly 


Ser 


Met 




Pro 


Thr 


Leu 










85 










90 










yb 




Leu 


He 


Gly 


Asp 


Phe 


He 


Leu 


Val 


Glu 


Lys 


Phe 


Ala 


T\7T~ 


bxy 


lie 


Lys 








100 










105 










1 1 U 




Asp 


Pro 


He 


Tyr 


Gin 


Lys 


Thr 


Leu 


He 


Glu 


Thr 


Gly 


n_L o 


Pro 


Lys 


Arg 






115 










120 










1 9 R 
1 Z 0 






Gly 


Asp 


He 


Val 


Val 


Phe 


Lys 


Tvr 


Pro 


Glu 


Asp 


Pro 


Lys 


Leu 


Asp 


Tyr 




130 










135 










140 






He 


Lys 


Arg 


Ala 


Val 


Gly 


Leu 


Pro 


Glv 


Asp 


Lys 


Val 


Thr 


Tyr 


Asp 


IT J_0 


145 










150 










155 








1 oU 


Val 


Ser 


Lys 


Glu 


Leu 


Thr 


He 


Gin 


Pro 


Gly Cys 


Ser 


Ser 


Gly 


Gin 


/II a 










165 










170 










175 




Cys 


Glu 


Asn 


Ala 


Leu 


Pro 


Val 


Thr 


Tvr 


Ser 


Asn 


Val 


Glu 


Pro 


Ser 


Asp 








180 










185 










190 




Phe 


Val 


Gin 


Thr 


Phe 


Ser 


Arg 


Arg 


Asn 


Gly 


Gly 


Glu 


Ala 


Thr 


Ser 


Gly 






195 










200 










205 






Phe 


Phe 


Glu 


Val 


Pro 


Lys 


Asn 


Glu 


Thr 


Lys 


Glu 


Asn Gly 


He 


Arg 


Leu 




210 










215 










220 








Ser 


Glu 


Arg 


Lys 


Glu 


Thr 


Leu 


Gly 


Asp 


Val 


Thr 


His 


Arg 


He 


Leu 


Thr 


225 










230 










235 








240 


Val 


Pro 


He 


Ala 


Gin 


Asp 


Gin 


Val 


Gly 


Met 


Tyr 


Tyr 


Gin 


Gin 


Pro 


Gly 



245 250 255 
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Gin Gin Leu Ala Thr Trp lie Val Pro Pro Gly Gin Tyr Phe Met Met 

260 265 270 

Gly Asp Asn Arg Asp Asn Ser Ala Asp Ser Arg Tyr Trp Gly Phe Val 

275 280 285 

Pro Glu Ala Asn Leu Val Gly Arg Ala Thr Ala lie Trp Met Ser Phe 

290 295 300 

Asp Lys Gin Glu Gly Glu Trp Pro Thr Gly Leu Arg Leu Ser Arg He 
305 310 315 320 

Gly Gly He His 



<210> 
<211> 
<212> 
<213> 



246 
586 
PRT 

E. Coll 



<400> 
Met Thr He 
1 

Ser Tyr Gin 

Pro Leu Phe 
35 

Leu Leu His 
50 

Glu Glu Ser 
65 

Gin Ser Asp 

His Ser lie 

Thr Leu Thr 
115 

Phe Gly Cys 
130 

Leu Val His 
145 

Leu Leu Ala 

Arg Glu Arg 

Val Ser Val 
195 

Glu Arg Glu 

210 
Glu Gin Ala 
225 

Ala Glu Ser 

His Asn His 

Arg Glu Ala 
275 

Pro Gin Trp 
290 



246 

Thr Lys Leu 
5 

Glu He Phe 
20 

Ser Asp Thr 

Thr Arg Ala 

Glu Tyr Leu 
70 

Ala Gly Gin 
85 

Arg Leu Arg 
100 

Gin Val Val 

Leu Arg Gin 

Gin Ala Asn 
150 

Gin Pro Leu 

165 
Phe Asp Trp 
180 

Pro Ser Met 

Ser Leu Ala 

He Tyr Ser 
230 

Val Thr Gin 

245 
Leu Pro Ala 
260 

Ala Arg Tyr 
He Leu Arg 



Ala Trp Arg 

Ala Gin Pro 
25 

Gin Pro Arg 
40 

Ser Ser Ser 
55 

Asn Leu He 

Leu Val Gly 

His Ala Val 
105 

Ala Ala Asp 
120 

Phe Asn Gly 
135 

Gly Gly He 

Leu Trp Met 

Val Ala Phe 
185 

Pro Leu Lys 

200 
Asp Phe Gin 
215 

Glu Phe Glu 

Trp Cys Arg 

Pro Gly Ala 
265 

Thr Gly Glu 

280 
Gin Cys Lys 
295 



Asp Leu Val 
10 

His Leu He 

Leu Gin Phe 

Phe Met Leu 
60 

Ala Asn Ala 
75 

Gly His Tyr 
90 

Ser Ala Asp 

Trp Val Glu 

Asp He Thr 
140 

Leu He He 
155 

Arg Leu Lys 
170 

Asp Glu Ser 

Leu Lys Val 

Glu Met Glu 
220 

Asp Thr Leu 

235 
Trp Val Thr 
250 

Asp Ala Trp 

Gin Glu Thr 

Glu Val Ala 
300 



Pro Asp 

Asp Glu 
30 

Ala Leu 
45 

Ala Lys 

Ala Arg 

Glu Val 

Asp Asn 
110 
Ala Glu 
125 

Leu Gin 

Ser Leu 

Asn He 

Arg Pro 
190 
He Leu 
205 

Pro Glu 

Gin He 

Phe Thr 

Pro He 
270 
Leu Pro 
285 

Ser Leu 



Thr Asp 
15 

Asn Asp 

Glu Gin 

Ala Pro 

Thr Leu 
80 

Ser Gly 
95 

Phe Ala 

Gin Leu 

Pro Gly 

Arg Thr 
160 
Val Asn 
175 

Leu Pro 

Val Gly 

Leu Ser 

Val Asp 
240 
Ala Arg 
255 

Leu He 
Leu Ser 
Cys Asp 
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Gly Asp 


Thr 


Phe 


Ser 


Gly 


Glu 


Gin 


Leu 


Asn 


Leu 


Met 


Leu 


Gin 


Gin 


Arg 


305 










310 










315 










320 


Glu 


Trp 


Arg 


Glu 


Gly 


Phe 


Leu 


Ala 


Glu 


Arg Met 


Gin 


Asp 


Glu 


He 


Leu 










325 










330 










335 




Gin 


Glu 


Gin 


He 


Leu 


He 


Glu 


Thr 


Glu 


Gly 


Glu 


Arg 


He 


Gly Gin 


He 








340 










345 










350 






Asn 


Ala 


Leu 


Ser 


Val 


He 


Glu 


Phe 


Pro 


Gly 


His 


Pro 


Arg Ala 


Phe 


Gly 






355 










360 










\j -J 






Glu 


Pro 


Ser 


Arg 


He 


Ser 


Cys 

J: 


Val 


Val 


His 


He 


Gly 




Gly Glu 


Phe 




370 










375 










380 










Thr 


Asp 


He 


Glu 


Arg 


Lvs 


Ala 


Glu 


Leu 


Gly Gly Asn 




nlS 


Ala 




385 










390 










"5 Q R 










'it \J U 


Gly Met 


Met 


He 


Met 


Gin 


Ala 


Phe 


Leu 


iYieti 


Ser 


blU 


Leu 


bin 


Leu 


Glu 










405 




















/lie 

4 lO 








He 


Pro 


Phe 


Ser 


Ala 


Ser 


Leu 


Thr 


rne 


blU 


bin 


Ser 


Tyr 


O C -L 








420 










425 










4 oU 




Glu 


Val 


Asp 


Gly 

J: 


Asp 


Ser 


Ala 


Ser 


Met 


Ala 


Glu 


Leu 


Cys 


Ala 


Leu 


T 1 p 

X _L C 






435 










440 










A A ^ 
4 4 0 








Ser 


Ala 


Leu 


Ala 


Asp 


Val 


Pro 


Val 


Asn 


Gin 


Ser 


He 


Ala 


He 


Thr 


o j_y 




450 










455 










460 








Ser 


Val 


Asp 


Gin 


Phe 


Glv 


Arg 


Ala 


Gin 


Pro 


Val 


Gly 


bly 


Leu 


Asn 


Glu 


4 bo 










470 










475 










4 O U 


Lys 


He 


Glu 


Gly 


Phe 


Phe 


Ala 


He 


Cys 


Gin 


Gin 


Arg 


Glu 


Leu 


Thr 












485 










490 
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Glu 


Glu 


Ala 


X _L tr 




A vrr 

y 


Leu 


riei. 


A 1 o 


Thr 


Met 


Lys 




130 








J- -J u 










1 4 u 








Glu 


Ala Ser 


Ala 


Gly 


Lys 


Leu 


Val 


Arg 


Thr 


Leu 


Ala 


Ala 


Val 


Arg Asp 


145 








150 










155 










160 


Ala 


Lys Glu 


Ala 


Ala 






























165 
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252 




























<211> 


121 
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252 
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Met Ser He Thr Lys Asp Gin He He Glu Ala Val Ala Ala Met Ser 

15 10 15 

Val Met Asp Val Val Glu Leu He Ser Ala Met Glu Glu Lys Phe Gly 

20 25 30 

Val Ser Ala Ala Ala Ala Val Ala Val Ala Ala Gly Pro Val Glu Ala 

35 40 45 

Ala Glu Glu Lys Thr Glu Phe Asp Val He Leu Lys Ala Ala Gly Ala 

50 55 60 

Asn Lys Val Ala Val He Lys Ala Val Arg Gly Ala Thr Gly Leu Gly 
65 70 75 80 

Leu Lys Glu Ala Lys Asp Leu Val Glu Ser Ala Pro Ala Ala Leu Lys 

85 90 95 

Glu Gly Val Ser Lys Asp Asp Ala Glu Ala Leu Lys Lys Ala Leu Glu 

100 105 110 

Glu Ala Gly Ala Glu Val Glu Val Lys 
115 120 



<210> 253 
<211> 714 
<212> PRT 
<213> E. Coli 



<400> 
Met Ser Arg 
1 

Thr Ser Val 

Arg Leu Ser 
35 

Ala Pro Asp 
50 

Thr Ala Ala 
65 

Ser Asn Gin 

Ala Asn Thr 

Thr Arg Lys 
115 

Thr Leu Asn 
130 

Pro Glu Gin 
145 

Gly Ala Lys 

Ala Pro Val 

Phe Asp Asp 
195 

Leu Gin Glu 

210 
Phe Asp Leu 
225 

Ala Thr He 
Val Thr Phe 



253 

He He 
5 

Ser Leu 
20 

Val Phe 

Gin Thr 

Glu Pro 

Lys Asp 
85 

Lys Asp 
100 

His Gin 

Ala Glu 

Leu Lys 

Asn Thr 
165 
Asp Glu 
180 

Ser Ser 

Ser Ser 

He Ala 

He Asn 
245 
Cys Ala 



Met Leu He 

Gly Val He 

Lys Pro He 
40 

Thr Thr He 
55 

Leu Lys Met 
70 

Val Leu Met 
Ala Glu Val 

Phe Ala Gin 

120 

He Val Phe 
135 

Glu Arg He 
150 

Asn He Thr 

Gin Gly Arg 

Lys Ala Lys 
200 

Pro Leu Pro 

215 
Thr Arg Ala 
230 

Glu Gly Asp 
Arg Ser He 



Pro Thr 
10 

Arg Ala 
25 

Ala Gin 

Val Arg 

Ser Tyr 

Glu Glu 
90 

Val Leu 
105 

Ser Leu 

Val Met 

Glu Leu 

Gly Val 
170 
Thr Arg 
185 

Val Asn 

Val Leu 

He Asp 

He Asn 
250 
Pro His 



Gly Thr Ser 

Met Glu Arg 

Pro Arg Thr 
45 

Ala Asn Ser 
60 

Val Glu Gly 
75 

He Val Ala 

Val Glu Gly 

Asn Tyr Glu 
125 

Ser Gin Gly 

140 
Thr Arg Asn 
155 

He Val Asn 

Pro Asp Leu 

Asn Val Asp 
205 

Gly Ala Val 

220 
Met Ala Arg 
235 

Thr Arg Arg 
Met Leu Glu 



Val Gly Leu 
15 

Lys Gly Val 
30 

Gly Gly Asp 

Ser Thr Thr 

Leu Leu Ser 
80 

Asn Tyr His 
95 

Leu Val Pro 
110 

He Ala Lys 

Thr Asp Thr 

Ser Phe Gly 
160 

Lys Leu Asn 

175 
Ser Glu He 
190 

Pro Ala Lys 

Pro Trp Ser 

His Leu Asn 
240 

Val Lys Ser 

255 
His Phe Arg 



-97- 



260 

Ala Gly Ser Leu 
275 

Ala Ala Cys Leu 
290 

Leu Thr Gly Gly 
305 

Arg Ala Phe Ala 

Trp Gin Thr Ser 
340 

Asp Asp His Glu 
355 

lie Asn Ala Asp 
370 

Arg Arg Leu Ser 
385 

Arg Lys Ala Gly 

Thr Val Lys Ala 
420 

Val Leu Leu Gly 
435 

Gly Val Glu Leu 
450 

Arg Glu Ser Tyr 
465 

Met Thr Glu Thr 

Gly Thr Leu Met 
500 

Ala Val His Thr 
515 

Lys Thr Ala Pro 
530 

Leu Pro Glu Gin 
545 

Pro Thr Ala Glu 

Ala Ala Ala Phe 
580 

Thr Gly Thr Ser 
595 

Thr Arg Leu Ala 
610 

Leu Gin Tyr Asp 
625 

Pro Asn Ser Pro 

Leu Asn Thr Gly 
660 

Leu lie Ser lie 
675 

Asp Leu Ser Arg 
690 

Leu Thr Ala lie 
705 



Leu Val Thr Ser 
280 

Ala Ala Met Asn 
295 

Tyr Glu Met Asp 
310 

Thr Gly Leu Pro 

325 

Leu Ser Leu Gin 

Arg lie Glu Lys 
360 

Trp He Glu Ser 
375 

Pro Pro Ala Phe 
390 

Lys Arg He Val 
405 

Ala Ala He Cys 

Asn Pro Ala Glu 
440 

Gly Ala Gly He 
455 

Val Gly Arg Leu 
470 

Val Ala Arg Glu 
485 

Leu Glu Gin Asp 

Thr Ala Asn Thr 
520 

Gly Ser Ser Leu 
535 

Val Tyr Val Tyr 
550 

Gin Leu Ala Glu 
565 

Gly He Glu Pro 

Gly Ala Gly Ser 
600 

Gin Glu Lys Arg 
615 

Ala Ala Val Met 
630 

Val Ala Gly Arg 
645 

Asn Thr Thr Tyr 

Gly Pro Met Leu 
680 

Gly Ala Leu Val 
695 

Gin Ser Ala Gin 
710 



265 

Ala Asp Arg Pro 

Gly Val Glu He 
300 

Ala Arg He Ser 
315 

Val Phe Met Val 
330 

Ser Phe Asn Leu 
345 

Val Gin Glu Tyr 

Leu Thr Ala Thr 
380 

Arg Tyr Gin Leu 
395 

Leu Pro Glu Gly 
410 

Ala Glu Arg Gly 
425 

He Asn Arg Val 

Glu He Val Asp 
460 

Val Glu Leu Arg 
475 

Gin Leu Glu Asp 
490 

Glu Val Asp Gly 
505 

He Arg Pro Pro 

Val Ser Ser Val 
540 

Gly Asp Cys Ala 
555 

He Ala He Gin 
570 

Arg Val Ala Met 
585 

Asp Val Glu Lys 

Pro Asp Leu Met 
620 

Ala Asp Val Ala 
635 

Ala Thr Val Phe 
650 

Lys Ala Val Gin 
665 

Gin Gly Met Arg 

Asp Asp He Val 
700 

Gin Gin 



270 

Asp Val Leu Val 
285 

Gly Ala Leu Leu 

Lys Leu Cys Glu 
320 

Asn Thr Asn Thr 
335 

Glu Val Pro Val 
350 

Val Ala Asn Tyr 
365 

Ser Glu Arg Ser 

Thr Glu Leu Ala 
400 

Asp Glu Pro Arg 
415 

He Ala Thr Cys 
430 

Ala Ala Ser Gin 
445 

Pro Glu Val Val 

Lys Asn Lys Gly 
480 

Asn Val Val Leu 
495 

Leu Val Ser Gly 
510 

Leu Gin Leu He 
525 

Phe Phe Met Leu 

He Asn Pro Asp 
560 

Ser Ala Asp Ser 
575 

Leu Ser Tyr Ser 
590 

Val Arg Glu Ala 
605 

He Asp Gly Pro 

Lys Ser Lys Ala 
640 

He Phe Pro Asp 
655 

Arg Ser Ala Asp 
670 

Lys Pro Val Asn 
685 

Tyr Thr He Ala 



<210> 
<211> 
<212> 
<213> 



254 
588 
PRT 

E. Coli 



<400> 
Met Asn Asn 
1 

Tyr Gin Glu 

lie lie Asp 
35 

Ser Gly Pro 
50 

Glu Tyr Thr 
65 

Thr Ala Val 

Met Met Thr 

Thr Thr Val 
115 

Ala Gly Phe 

130 
Gin Tyr Leu 
145 

Val Asn Gly 

His Pro Gin 

He Ser Gly 
195 

Leu Thr Leu 
210 

Ala Tyr He 
225 

Glu Glu Gly 

Glu Gly Ser 

Glu Phe Asn 
275 

Trp Glu He 

290 
He Glu Gin 
305 

Trp Ser Thr 

Pro Gly Lys 

Thr Val Gin 
355 

Leu Gin Ala 

370 
Gly Asn Thr 



254 

Ser He 
5 

Leu Leu 
20 

Asn Val 

He Val 

Asp Ala 

Pro Gly 

85 
Pro Val 
100 

He Cys 

Ala Trp 

Gin Val 

Ala Ser 
165 
Val Thr 
180 

Gin Asn 

Asp Gly 

Thr Ala 

Arg Arg 
245 
Ala Ala 
260 

Ser Pro 

Ala His 

His Asn 

Ala Arg 
325 
Gin Ala 
340 

Gin Val 
Glu Glu 
He Ala 



Asn His Lys 

Ala Val Ser 

Ser He Leu 
40 

He Lys Gly 
55 

Pro Ala Leu 
70 

Phe He Asp 

Thr Phe Glu 

Asp Pro His 
120 

Phe Ala Arg 

135 
Ser Ser Cys 
150 

Phe Thr Leu 

Gly Leu Ala 

Ala Leu Leu 
200 

His Cys Pro 

215 
Gly He Glu 
230 

Lys Leu Gin 

Arg Asn Leu 

Gin Cys Met 
280 

Glu Gly His 

295 
Val Pro Leu 
310 

His Phe Gly 

Asp He Val 

Leu Val Lys 
360 

Ser Ala Arg 
375 

Arg Gin Pro 



Phe His 
10 

Arg Gly 
25 

Asp Leu 

Arg Tyr 

Gin Arg 

Ala His 
90 

Thr Ala 
105 

Glu He 

Cys Ala 

Val Pro 

Glu Gin 
170 
Glu Met 
185 

Asp Lys 

Gly Leu 

Asn Cys 

Leu Gly 
250 
Asn Ala 
265 

Leu Cys 

He Asp 

His Val 

Leu Asn 
330 
Leu Leu 
345 

Gly Glu 
Leu Ala 
Val Ser 



His He Ser 

Asp Ala Val 

He Asn Gly 
45 

He Ala Gly 
60 

He Asp Ala 
75 

Leu His He 

Thr Leu Pro 

Val Asn Val 
125 

Glu Gin Ala 
140 

Ala Leu Glu 
155 

Met Leu Ala 

Met Asp Tyr 

Leu Asp Ala 
205 

Gly Gly Lys 

220 
His Glu Ser 
235 

Met Ser Leu 

Leu Ala Pro 

Thr Asp Asp 
285 

Ala Leu He 
300 

Ala Tyr Arg 
315 

His Leu Gly 

Ser Asp Ala 

Pro He Asp 
365 

Gin Ser Ala 

380 
Ala Ser Asp 



Arg Ala Glu 
15 

Ala Asp Tyr 
30 

Gly Glu He 

Val Gly Ala 

Arg Gly Ala 
80 

Glu Ser Ser 
95 

Arg Gly Leu 

110 

Met Gly Glu 

Arg Gin Asn 

Gly Cys Asp 
160 

Trp Arg Asp 

175 
Pro Gly Val 
190 

Phe Arg His 

Glu Leu Asn 

Tyr Gin Leu 
240 

Met He Arg 

255 
Leu He Asn 
270 

Arg Asn Pro 

Arg Arg Leu 

Val Ala Ser 
320 

Leu Leu Ala 

335 
Arg Lys Val 
350 

Ala Gin Thr 
Pro Pro Tyr 
Phe Ala Leu 
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385 390 395 400 



Gin 


Phe 


Thr 


Pro 


Gly 


Lys 


Arg 


Tvr 


Arg 


Val 


He 


Asp 


Val 


He 


His 


Asn 










405 










410 










415 




Glu 


Leu 


He 


Thr 


His 


Ser 


His 


Ser 


Ser 


Val 


Tvr 


Ser 


Glu 


Asn 




Phe 








420 










425 










430 






Asp 


Arg 


Asp 


Asp 


Val 


Ser 


Phe 


He 


Ala 


Val 


Leu 


Glu 


Arg 


Tvr 


Gly 


Gin 






435 










440 










445 








Arg 


Leu 


Ala 


Pro 


Ala 


Cys 


Gly 


Leu 


Leu 


Glv 


Gly 


Phe 


Gly 


Leu 


Asn 


Glu 




450 










455 










460 










Gly 


Ala 


Leu 


Ala 


Ala 


Thr 


Val 


Ser 


His 


Asp 


Ser 


His 


Asn 


He 


Val 


Val 


465 










470 










475 










480 


lie 


Gly 


Arg 


Ser 


Ala 


Glu 


Glu 


Met 


Ala 


Leu 


Ala 


Val 


Asn 


Gin 


Val 


T 1 

-L -L O 










485 










490 










4 95 




Gin 


Asp 


Glv 


Glv 


Glv 


Leu 


Cys 


Val 


Val 


Arg 


Asn 


Gly 


Gin 


Val 


r;i n 










500 










505 










510 






His 


Leu 


Pro 


Leu 


Pro 


He 


Ala 


Gly 


Leu 


Met 


Ser 






1 i i-L 


Al 


(^1 n 






515 










520 










S 9 S 
O ^ O 








Ser 


Leu 


Ala 


Glu 


Gin 


He 


Asp 


Ala 


Leu 


Lys 


Ala 


Ala 


Ala 


Arg 


Glu 


Cys 




530 










535 










S4 n 

^ 'i U 










Gly 


Pro 


Leu 


Pro 


Asp 


Glu 


Pro 


Phe 


He 


Gin 


Met 


Ala 


Phe 


Leu 


Ser 


Leu 


545 




















ODD 










n 

0 bU 


Pro 


Val 


He 


n J- <J 




Leu 


Xjys 


Leu 


1 njc 


Q i=i V 

oer 


Lj±n 


biy 


Leu 


t^ne 


Asp 


bly 










^ U >J 










R7 n 
o / u 










e; "7 c: 
0/0 




Glu 


Lys 


Phe 


Ala 


Phe 


Thr 


Thr 


Leu 


Glu 


Val 


Thr 


Glu 
















580 










585 


















<210> 


255 




























<211> 


408 




























<212> 


PRT 




























<213> 


E. Cell 


























<400> 


255 


























Met 


Ala 


Tyr 


Cys 


Asn 


Pro 


Gly 


Leu 


Glu 


Ser 


Arg 


Pro 




Lys 


Arg Asn 


1 








5 










10 










15 




Ala 


Leu 


Arg 


Arg 


His 


Val 


Val 


Thr 


Gly 


He 


Gly Met 


Lys 


-L X t: 


Val 


He 








20 










25 










30 






Ala 


Pro 


Asp 


Ser 


Tyr 


Lys 


Glu 


Ser 


Leu 


Ser 


Ala 


Ser 


Glu 


V ciJ. 


Ala 


Gin 






35 










40 










45 








Ala 


He 


Glu 


Lys 


Gly 


Phe 


Arg 


Glu 


He 


Phe 


Pro 


Asp 


Al 


o J_li 


Tyr 


Val 




50 










55 










60 








Ser 


Val 


Pro 


Val 


Ala 


Asp 


Gly 


Gly 


Glu 


Gly 


Thr 


Val 


O-L LI 


Al 3 


Met 


He 


65 










70 










75 










80 


Ala 


Ala 


Thr 


Gin 


Gly 


Ala 


Glu 


Arg 


His 


Ala 


Trp 


Val 


1 iii 


o± y 


Pro 


Leu 










85 










90 










95 




Gly 


Glu 


Lys 


Val 


Asn 


Ala 


Ser 


Trp 


Glv 


He 


Ser 


Gly 


Asp 


Gly 


Lys 


Thr 








100 










105 










110 






Ala 


Phe 


He 


Glu 


Met 


Ala 


Ala 


Ala 


Ser 


Gly 


Leu 


Glu 


Leu 


Val 


Pro 


Ala 






115 










120 










125 








Glu 


Lys 


Arg 


Asp 


Pro 


Leu 


Val 


Thr 


Thr 


Ser 


Arg 


Gly 


ThT 
± ill. 


f;i \7 


Glu 


Leu 




130 










135 










140 








He 


Leu 


Gin 


Ala 


Leu 


Glu 


Ser 


Gly 


Ala 


Thr 


Asn 


He 


lie 


He 


Gly 


He 


145 










150 










155 








160 


Gly 


Gly 


Ser 


Ala 


Thr 


Asn 


Asp 


Gly 


Gly 


Ala 


Gly Met 


Val 


Gin 


Ala 


Leu 










165 










170 










175 




Gly 


Ala 


Lys 


Leu 


Cys 


Asp 


Ala 


Asn 


Gly 


Asn 


Glu 


He 


Gly 


Phe 


Gly 


Gly 








180 










185 










190 
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Gly 


Ser 


Leu 


Asn 


Thr 


Leu 


Asn 


Asp 


He 


Asp 


He 


Ser 


Glv 


Leu 


Asp 


Pro 






195 










200 










205 








Arg 


Leu 


Lys 


Asp 


Cys 


Val 


He 


Arg 


Val 


Ala 


Cvs 


Asp 


Val 


Thr 


Asn 


Pro 




210 










215 










220 










Leu 


Val 


Gly 


Asp 


Asn 


Gly 


Ala 


Ser 


Arg 


He 


Phe 


Glv 

V_) J_ 


Pro 


Gin 


Lys 


Gly 


225 










230 










235 










240 


Ala 


Ser 


Glu 


Ala 


Met 


He 


Val 


Glu 


Leu 


Asp 


Asn 


Asn 


Leu 


Ser 


His 












245 










250 










255 




Ala 


Glu 


Val 


He 


Lys 


Lys 


Ala 


Leu 


His 


Val 


Asp 


Val 


Lys 


Asp 


Val 


Pro 








260 










265 










270 






Gly Ala 


Gly 


Ala 


Ala 


Gly 


Gly 


Met 


Gly 
w J. 


Ala 


Ala 


Leu 


Met 


Ala 


Phe 


Leu 






275 










280 










285 








Gly Ala 


Glu 


Leu 


Lys 


Ser 


Gly 


He 


Glu 


He 


Val 


Thr 


Thr 


Ala 


Leu 


Asn 




290 










295 










300 










Leu 


Glu 


Glu 


His 


He 


His 


Asp 


Cys 


Thr 


Leu 


Val 


He 


Thr 


Gly 


Glu 


Gly 


305 










310 










315 












rli. y 


He 


Asp 


Ser 


Gin 


Ser 


He 


His 


Glv 
\_» J. J 


Lys 


Val 


Pro 


He 


Gly 


Val 


Ala 










325 










330 










-J -J O 




Asn 


Val 


Ala 


Lys 


Lys 


Tvr 


His 


Lys 


Pro 


Val 


He 


Gly 


He 


Ala 




Cor- 
O c: X 








340 










345 










■J w 






Leu 


Thr 


Asp 


Asp 


Val 


Gly 


Val 


Val 


His 


Gin 


His 


Gly 


He 




AT ^ 


V dX 






355 










360 










365 








Phe 


Ser 


Val 


Leu 


Thr 


Ser 


T 1 ^ 
-L _L 




1 ill. 


Leu 


fisp 


o±U 


/\_La 


irne 


Arg 


biy 




370 










375 










380 








Al Pi 


Tyr 


Asp 




J. j_ t: 




Jr\l. y 


Ala 


oer 


i-ir g 


Asn 


lie 


Ala 


Ala 


Thr 


Leu 


385 










o _? u 




















4 UU 


Ala 


He 


Gly 


Met 


Arg 


Asn 


Ala 


C^l \r 
o J. ^ 


























405 


























<210> 


256 




























<211> 


299 
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PRT 
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E. Coll 


























<400> 


256 


























Met 


He 


Asp 


Met 


Thr 


Met 


Lys 


Val 


Gly 


Phe 


He 


Gly 


XjC u. 


Gly 


He 




1 








5 










10 










1 R 

± D 




Gly 


Lys 


Pro 


Met 


Ser 


Lys 


Asn 


Leu 


Leu 


Lys 


Ala 


Gly 


i yr 


O t: J_ 


Leu 


V dX 








20 










25 
















Val 


Ala 


Asp 


Arg 


Asn 


Pro 


Glu 


Ala 


He 


Ala 


Asp 


Val 


He 


Ala 


Ala 


Gly 






35 










40 










45 






Ala 


Glu 


Thr 


Ala 


Ser 


Thr 


Ala 


Lvs 


Ala 


He 


Ala 


Glu 


Gin 




Asp 


Val 




50 










55 










60 






He 


He 


Thr 


Met 


Leu 


Pro 


Asn 


Ser 


Pro 


His 


Val 


Lys 


Glu 


Val 


Ala 


Leu 


65 










70 










75 










80 


Gly 


Glu 


Asn 


Gly 


He 


He 


Glu 


Gly 


Ala 


Lys 


Pro 


Gly 


Thr 


Val 


Leu 


He 










85 










90 










95 




Asp 


Met 


Ser 


Ser 


He 


Ala 


Pro 


Leu 


Ala 


Ser 


Arg 


Glu 


He 


Ser 


Glu 


Ala 








100 










105 










110 






Leu 


Lys 


Ala 


Lys 


Gly 


He 


Asp 


Met 


Leu 


Asp 


Ala 


Pro 


Val 


Ser 


Gly Gly 






115 










120 










125 








Glu 


Pro 


Lys 


Ala 


He 


Asp 


Gly 


Thr 


Leu 


Ser 


Val 


Met 


Val 


Gly 


Gly Asp 




130 










135 










140 










Lys 


Ala 


He 


Phe 


Asp 


Lys 


Tyr 


Tyr 


Asp 


Leu 


Met 


Lys 


Ala 


Met 


Ala 


Gly 


145 










150 










155 










160 


Ser 


Val 


Val 


His 


Thr 


Gly 


Glu 


He 


Gly 


Ala 


Gly 


Asn 


Val 


Thr 


Lys 


Leu 



165 170 175 

Ala Asn Gin Val He Val Ala Leu Asn He Ala Ala Met Ser Glu Ala 

180 185 190 

Leu Thr Leu Ala Thr Lys Ala Gly Val Asn Pro Asp Leu Val Tyr Gin 

195 200 205 

Ala He Arg Gly Gly Leu Ala Gly Ser Thr Val Leu Asp Ala Lys Ala 

210 215 220 

Pro Met Val Met Asp Arg Asn Phe Lys Pro Gly Phe Arg He Asp Leu 
225 230 235 240 

His He Lys Asp Leu Ala Asn Ala Leu Asp Thr Ser His Gly Val Gly 

245 250 255 

Ala Gin Leu Pro Leu Thr Ala Ala Val Met Glu Met Met Gin Ala Leu 

260 265 270 

Arg Ala Asp Gly Leu Gly Thr Ala Asp His Ser Ala Leu Ala Cys Tyr 

275 280 285 

Tyr Glu Lys Leu Ala Lys Val Glu Val Thr Arg 
290 295 



<210> 257 

<211> 256 

<212> PRT 

<213> E. Coli 

<400> 257 

Met Asn Asn Asp Val Phe Pro Asn Lys Phe Lys Ala Ala Leu Ala Ala 

15 10 15 

Lys Gin Val Gin He Gly Cys Trp Ser Ala Leu Ser Asn Pro He Ser 

20 25 30 

Thr Glu Val Leu Gly Leu Ala Gly Phe Asp Trp Leu Val Leu Asp Gly 

35 40 45 

Glu His Ala Pro Asn Asp He Ser Thr Phe He Pro Gin Leu Met Ala 

50 55 60 

Leu Lys Gly Ser Ala Ser Ala Pro Val Val Arg Val Pro Thr Asn Glu 
65 70 75 80 

Pro Val He He Lys Arg Leu Leu Asp He Gly Phe Tyr Asn Phe Leu 

85 90 95 

He Pro Phe Val Glu Thr Lys Glu Glu Ala Glu Leu Ala Val Ala Ser 

100 105 110 

Thr Arg Tyr Pro Pro Glu Gly He Arg Gly Val Ser Val Ser His Arg 

115 120 125 

Ala Asn Met Phe Gly Thr Val Ala Asp Tyr Phe Ala Gin Ser Asn Lys 

130 135 140 

Asn He Thr He Leu Val Gin He Glu Ser Gin Gin Gly Val Asp Asn 
145 150 155 160 

Val Asp Ala He Ala Ala Thr Glu Gly Val Asp Gly He Phe Val Gly 

165 170 175 

Pro Ser Asp Leu Ala Ala Ala Leu Gly His Leu Gly Asn Ala Ser His 

180 185 190 

Pro Asp Val Gin Lys Ala He Gin His He Phe Asn Arg Ala Ser Ala 

195 200 205 

His Gly Lys Pro Ser Gly He Leu Ala Pro Val Glu Ala Asp Ala Arg 

210 215 220 

Arg Tyr Leu Glu Trp Gly Ala Thr Phe Val Ala Val Gly Ser Asp Leu 
225 230 235 240 

Gly Val Phe Arg Ser Ala Thr Gin Lys Leu Ala Asp Thr Phe Lys Lys 
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245 



250 



255 



<210> 258 
<211> 444 
<212> PRT 
<213> E. Coli 



<400> 258 
Met lie Leu Asp 
1 

Tyr Leu lie Leu 
20 

Asp Arg Ala Thr 
35 

Gin Leu Ser Ala 
50 

Ala Tyr Leu Leu 
65 

Gly Ser Lys Lys 

Thr Phe Leu Gin 
100 

lie Ser Met Phe 
115 

Ser Phe Pro Ala 
130 

Glu Arg Gly Thr 
145 

Leu Ala Leu Phe 

Trp Glu His Val 
180 

Ala Leu Trp lie 
195 

Ser Ala Glu Glu 
210 

Met Asp His Lys 
225 

His Tyr He Lys 

Phe Gly Gin Tyr 
260 

Phe Pro He Tyr 
275 

Gly Leu Val Ala 
290 

Leu Gly Gly Val 
305 

Thr Leu Ala Arg 

Thr He He Leu 
340 

Leu Met Ala Leu 
355 

Pro Val He Ser 
370 



Thr Val Asp Glu 
5 

Leu He He Phe 

Leu Ser He Ala 
40 

Val Ser Met Gly 
55 

Met Gin He Pro 
70 

Val Tyr Thr Tyr 
85 

Gly Phe Val Asp 

Phe Met Arg Phe 
120 

Asn Ala Arg He 
135 

Ala Ser Ala He 
150 

Ser Pro Leu Leu 
165 

Phe Thr Val Met 

Lys Leu He His 
200 

Leu Lys Phe He 
215 

Lys Pro Gly Ser 
230 

Gin Leu Leu Ser 
245 

Phe He Asn Thr 

Leu Val Gin Glu 
280 

Ser He Pro Ala 
295 

Phe Ser Asp Tyr 
310 

Lys Leu Pro He 
325 

Cys Asn Tyr Thr 

Ala Phe Phe Gly 
360 

Asp Thr Ala Pro 
375 



Lys Lys Lys Gly 
10 

He Val Thr Ala 

25 

Gly Thr Glu Val 

Tyr He Phe Ser 
60 

Gly Gly Trp Leu 
75 

Ser Leu Phe Phe 
90 

Met Phe Pro Leu 
105 

Met Leu Gly Phe 

Val Ala Ala Trp 
140 

Phe Asn Ser Ala 
155 

Gly Trp Leu Thr 
170 

Gly Val He Gly 
185 

Asn Pro Thr Asp 

Ser Glu Asn Gly 
220 

Ala Ala Ala Ser 
235 

Asn Arg Met Met 
250 

He Thr Trp Phe 
265 

Lys Gly Met Ser 

Leu Cys Gly Phe 
300 

Leu He Lys Arg 
315 

Val Leu Gly Met 
330 

Asn Asn Thr Thr 
345 

Lys Gly Phe Gly 

Lys Glu He Val 
380 



val 


nis 


Thr 


Arg 






15 




Val 


Asn 


Tyr 


Ala 




o u 






Ala 


Lys 


Glu 


Leu 










Ala 


Phe 


Gly 


Trp 


Leu 


Asp 


Lys 


Phe 








80 


Trp 


Ser 


Leu 


Phe 






95 




Ala 


Trp 


Ala 


Gly 




IIU 






Ser 


Glu 


Ala 


Pro 


IOC 

125 








Phe 


Pre 


Thr 


Lys 


Gin 


Tyr 


Phe 


Ser 








160 


Phe 


Ala 


Trp 


Gly 






175 




Phe 


Val 


Leu 


Thr 




1 yu 






His 


Pro 


Arg 


Met 


o ri c 








Ala 


Val 


Val 


Asp 


Gly 


Pro 


Lys 


Leu 








240 


Leu 


Gly 


Val 


Phe 






255 




Phe 


Leu 


Thr 


Trp 




270 






He 


Leu 


Lys 


Val 


285 








Ala 


Gly 


Gly Val 


Gly 


Leu 


Ser 


Leu 








320 


Leu 


Leu 


Ala 


Ser 






335 




Leu 


Val 


Val 


Met 




350 






Ala 


Leu 


Gly 


Trp 


365 








Gly 


Leu 


Cys 


Gly 
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Gly Val Phe Asn Val Phe Gly Asn Val Ala Ser He Val Thr Pro Leu 
385 390 395 400 

Val He Gly Tyr Leu Val Ser Glu Leu His Ser Phe Asn Ala Ala Leu 

405 410 415 

Val Phe Val Gly Cys Ser Ala Leu Met Ala Met Val Cys Tyr Leu Phe 

420 425 430 

Val Val Gly Asp He Lys Arg Met Glu Leu Gin Lys 
435 440 



<210> 
<211> 
<212> 
<213> 



259 
511 
PRT 

E. Coli 



Met 
1 

Val 

Thr 

Gly 

Ser 

65 

Val 

Leu 

Lys 

Gly 

Asp 
145 
He 

Glu 

He 

Arg 

Ser 
225 
Arg 

Gly 

Asn 

Gly 

Leu 
305 
Met 



<400> 
Gin Thr 

Tyr Lys 

Leu His 

35 
Lys Ser 
50 

Gly Thr 

His Ala 

Phe Pro 

Lys Gin 
115 
Cys Gin 
130 

Arg Gin 

Leu He 

Arg Leu 

Val Phe 
195 
He Ser 
210 

Glu Leu 

Glu Lys 

Asn Arg 

Leu Thr 
275 
Glu He 
290 

Ala Glu 
Leu Asn 



259 

Ser Asp Thr 
5 

Gin Tyr Ser 
20 

Gin Gly Glu 

Thr Leu Met 

Leu Glu He 
70 

His Gin Leu 
85 

Ser Leu Ser 
100 

Leu Ser Met 

Phe Asp Leu 

Met Val Glu 
150 

Leu Asp Glu 
165 

Phe Ser Arg 
180 

He Ser His 

Val Met Arg 

Ser Thr Asp 
230 

Ser Leu Ser 

245 
Pro Gin His 
260 

Gly Glu Gly 

Leu Gly Leu 

Thr Leu Tyr 
310 

Gly Lys Glu 



Arg Ala 

Gly Val 

Val His 
40 

Lys He 
55 

Glu Gly 

Gly He 

He Lys 

Gin Lys 
120 
His Ser 
135 

He Leu 

Pro Thr 

Leu Gin 

Lys Leu 
200 
Asp Gly 
215 

Asp He 

Ala Ser 

Ala Ala 

Phe Arg 
280 
Ala Gly 
295 

Gly Leu 



Leu Pro 
10 

Asn Val 
25 

Ala Leu 

He Ala 

Asn Asn 

Tyr Leu 
90 

Glu Asn 
105 

Met Lys 

Leu Ala 

Arg Gly 

Ala Ser 
170 
Glu Leu 
185 

Pro Glu 

Thr He 

He Gin 

Gin Lys 
250 
Gly Thr 
265 

Asn Val 



Leu Val 
Arg Thr 
He Asn Lys Leu 



Leu 

Leu 

Leu 

Gly 

Tyr 

75 

Val 

He 

Asn 

Gly 

Leu 
155 
Leu 

Leu 

He 

Ala 

Ala 
235 
Leu 

Pro 

Ser 

Gly 

Leu 
315 
Ser 



Leu Cys Ala 
Lys 
Gly 



He 

60 

Val 

Pro 

Leu 

Leu 

Ser 
140 
Met 

Thr 

Ala 

Arg 

Leu 
220 
He 

Trp 

Val 

Leu 

Ala 

300 
Arg 

Thr 



Gly He 

30 
Gly Asn 
45 

Thr Pro 



Arg Leu 

Gin Glu 

Phe Gly 
110 
Leu Ala 
125 

Leu Asp 

Arg Asp 

Pro Ala 

Thr Gly 
190 
Gin He 
205 

Ser Gly 

Thr Pro 

Leu Glu 

Leu Thr 
270 
Thr Leu 
285 

Gly Arg 
Gly Gly 
Gly Glu 



Arg Ser 
15 

Asp Phe 

Gly Ala 

Ala Asp 

Thr Pro 

80 
Pro Leu 
95 

Leu Ala 

Ala Leu 

Val Ala 

Ser Arg 
160 
Glu Thr 
175 

Val Gly 

Ala Asp 

Lys Thr 

Ala Val 
240 
Leu Pro 
255 

Leu Glu 

Asn Ala 

Thr Glu 

Arg He 
320 
Arg Leu 



-104- 



Leu Arg 

Asn Leu 

Leu Arg 
370 
Arg Tyr 
385 

Ala Arg 

Cys Leu 

Gly Val 

lie Ala 
450 
Glu He 
465 

He Thr 
Met Arg 



325 

Gly Leu Val 

340 
Asp Ala Ser 
355 

Gly Phe Trp 

Arg Arg Ala 

Thr Leu Ser 
405 

Glu Ala Ser 
420 

Asp Val Ser 
435 

Ala Gin Asn 

Glu Leu Met 

His Ser Ala 
485 

Val Ala Phe 
500 



Tyr Leu 

Leu Ala 

Ala Lys 
375 
Leu Asn 
390 

Gly Gly 

Pro Gin 

Ala Arg 

Val Ala 
455 
Ala Asp 
470 

Leu Thr 



Pro 

Trp 
360 
Thr 

He 

Asn 

Val 

Asn 
440 
Val 



Arg 
Glu 
Gly Asp Ser 



330 
Glu Asp 
345 

Asn Val 

Ala Lys 

Lys Phe 

Gin Gin 
410 
Leu He 
425 

Asp He 

Leu Leu 

Val Tyr 

Arg Asp 
490 
Gin Arg 
505 



Arg 

Cys 

Asp 

Asn 
395 
Lys 

Val 

Tyr 

He 

Val 
475 
He 

Gin 



Gin 

Ala 

Asn 
380 
Gin 

He 

Asp 

Gin 

Ser 
460 
Met 

Asn 

Glu 



Ser Ser 
350 
Leu Thr 
365 

Ala Thr 

Pro Glu 

Leu He 

Glu Pro 
430 
Leu Leu 
445 

Ser Asp 

His Gin 

Val Glu 

Ala Ser 
510 



335 

Gly Leu 

His Asn 

Leu Glu 

Gin Ala 
400 
Ala Lys 
415 

Thr Arg 

Arg Ser 

Leu Glu 

Gly Glu 
480 
Thr He 
495 
Cys 



<210> 
<211> 
<212> 
<213> 



260 
342 
PRT 

E. Coll 



Met 
1 

Val 

Ser 

Leu 

Val 

65 

Leu 

Leu 

Leu 

Arg 

Pro 
145 
Ala 

Leu 

Leu 



<400> 
Leu Lys 

Val Leu 

Val Gin 
35 

Leu Ala 
50 

Ser Val 

Leu Asn 

Leu Gly 

Lys He 
115 
Gly He 
130 

Ala Glu 

He Gly 

Ala Lys 

Gin Gly 
195 



260 

Phe He 
5 

Leu Phe 
20 

Thr Leu 

Met Gly 

Gly Ser 

Ala Gly 

85 
Leu Leu 
100 

Pro Ala 

Met Leu 

Leu Lys 

Trp Leu 
165 
Thr Ala 
180 

Ala Arg 



Gin 

Val 

Thr 

Ala 

He 

70 

Tyr 

Ala 

He 

Leu 

Gin 
150 
Thr 

Phe 

Gin 



Asn Asn 

Leu Pro 

Met Val 
40 

Thr Leu 
55 

Thr Gly 

Ser Leu 

Gly Phe 

Val Ala 
120 
Trp Thr 
135 

Leu Ser 

He He 

Gly Arg 

Leu Gly 
200 



Arg Glu 
10 

Gly Phe 
25 

Tyr Ser 

Val Met 

Met Cys 

Pro Val 
90 

Phe Asn 
105 

Thr Leu 

Gly Gly 

Ala Pro 

Leu Val 
170 
Ser Phe 
185 

Val Arg 



He 

Leu 

Ser 

Leu 

Ala 

75 

Ala 

Gly 

Gly 

Lys 

Leu 
155 
Ala 

Tyr 

Thr 



Thr Ala 

Asp Arg 

Ala Gin 
45 

Thr Arg 
60 

Val Leu 

Cys Val 

Val Leu 

Thr Leu 
125 
Trp He 
140 

Leu Leu 

Phe Met 

Ala Thr 

Glu Ala 
205 



Leu Leu Ala 
15 

Gin Tyr Leu 
30 

He Leu He 

Asn He Asp 

Leu Gly Met 
80 

Ala Thr Leu 
95 

Val Ala Trp 
110 

Gly Leu Tyr 

Glu Gly Leu 

Gly Val Ser 
160 

Ala Trp Leu 
175 

Gly Asp Asn 
190 

He Arg He 
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Val 


Ala 


Phe 


Ser 


Leu 


Asn 


Gly 


Cys 


Met 


Ala 


Ala 


Leu 


Ala 


Glv 


He 


Val 




210 










215 










220 










Phe 


Ala 


Ser 


Gin 


He 


Gly 


Phe 


He 


Pro 


Asn 


Gin 


Thr 


Gly 


Thr 


Glv 


Leu 


225 










230 










235 










240 


Glu 


Met 


Lys 


Ala 


He 


Ala 


Ala 


Cvs 


Val 


Leu 


Gly Gly 


He 


Ser 


Leu 


Leu 










245 










250 










255 




Gly 


Gly 


Ser 


Gly 


Ala 


He 


He 


Gly 


Ala 


Val 


Leu 


Gly Ala 


Trp 


Phe 


Leu 








260 










265 










270 






Thr 


Gin 


He 


Asp 


Ser 


Val 


Leu 


Val 


Leu 


Leu 


JrxJL y 


He 


Pro 


Ala 


Tro 


Tro 






275 










280 










285 








Asn 


Asp 


Phe 


He 


Ala 


Glv 


Leu 


Val 


Leu 


Leu 


Ala 


Val 


Leu 


Val 


Phe 


Asp 




290 










295 










300 








Gly 


Arg 


Leu 


Arg 


Cys 


Ala 


Leu 


Glu 


Arg 


Asn 


Leu 


Arg Arg 


Gin 


Lys 


Tyr 


305 










310 




















J z u 


Ala 


Arg 


Phe 


Met 


Thr 


Pro 


Pro 


Pro 


Ser 


Val 


Lys 


Pro 


Ala 


O t; J. 


Oci 


o JLy 










325 










330 










O O -J 




Lys 


Lys 


Arg 


Glu 


Ala 


Ala 




























340 




























<210> 


261 




























<211> 


330 




























<212> 


PRT 




























<213> 


E. Coll 


























<400> 


261 


























Met 


Arg 


He 


Arg 


Tyr 


Gly 


TrD 


Glu 


Leu 


Ala 


Leu 


Ala 


Ala 


-i-l t- KJL 




V d-L 


1 








5 










10 










1 R 




lie 


Glu 


He 


Val 


Ala 


Phe 


Gly 


Ala 


He 


Asn 


Pro 


Arg 


Met 


T ,on 
J-Jt; u. 




Leu 








20 










25 
















Asn 


Met 


Leu 


Leu 


Phe 


Ser 


Thr 


Ser 


Asp 


Phe 


He 


Cys 


He 


\j-L y 


J. X tr 


V3 1 
V dJ. 






35 










40 










45 








Ala 


Leu 


Pro 


Leu 


Thr 


Met 


Val 


He 


Val 


Ser 


Gly Gly 


He 


Asp 


-L -L C 


o t; i. 




50 










55 










60 










Phe 


Gly 


Ser 


Thr 


He 


Glv 


Leu 


Cys 


Ala 


He 


Ala 


Leu 


Gly 


Val 


Leu 


Phe 


65 










70 










75 








O w 


Gin 


Ser 


Gly 


Val 


Pro 


Met 


Pro 


Leu 


Ala 


He 


Leu 


Leu 


Thr 


Leu 


Leu 


Leu 










85 










90 










J -J 




Gly 


Ala 


Leu 


Cys 


Gly 


Leu 


He 


Asn 


Ala 


Gly 


Leu 


He 


He 


J. y ± 


Thr 


±jy o 








100 










105 










110 






Val 


Asn 


Pro 


Leu 


Val 


He 


Thr 


Leu 


Gly 


Thr 


Leu 


Tyr 


Leu 


Phe 




y 






115 










120 










125 






Ser 


Ala 


Leu 


Leu 


Leu 


Ser 


Glv 


Met 


Ala 


Gly 


Ala 


Thr 


Gly 


1 y j_ 


Glu 


ox y 




130 










135 










140 










lie 


Gly 


Gly 


Phe 


Pro 


Met 


Ala 


Phe 


Thr 


Asp 


Phe 


Ala 


Asn 


j-ieu 




V ctX 


145 










150 










155 








± D U 


Leu 


Gly 


Leu 


Pro 


Val 


Pro 


Leu 


He 


He 


Phe 


Leu 


He 


Cys 


Leu 


Leu 


V;=i 1 

V Ct X 










165 










170 










1 7 S 




Phe 


Trp 


Leu 


Trp 


Leu 


His 


Lvs 


Thr 


His 


Ala 


Gly Arg Asn 


Val 


Phe 


Leu 








180 










185 










190 






He 


Gly 


Gin 


Ser 


Pro 


Arq 


Val 


Ala 


Leu 


Tvr 

x _y j_ 


Ser 


Ala 


He 


Pro 


Val 


Asn 






195 










200 










205 








Arg 


Thr 


Leu 


Cys 


Ala 


Leu 


Tyr 


Ala 


Met 


Thr 


Gly Leu Ala 


Ser 


Ala 


Val 




210 










215 










220 










Ala 


Ala 


Val 


Leu 


Leu 


Val 


Ser 


Tyr 


Phe 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Leu 


225 










230 










235 








240 


Gly 


Ala 


Ser 


Phe 


Leu 


Met 


Pro 


Ala 


He 


Thr 


Ala 


Val 


Val 


Leu 


Gly 


Gly 










245 










250 










255 



Ala Asn He Tyr Gly Gly Ser Gly Ser He He Gly Thr Ala He Ala 

260 265 270 

Val Leu Leu Val Gly Tyr Leu Gin Gin Gly Leu Gin Met Ala Gly Val 

275 280 285 

Pro Asn Gin Val Ser Ser Ala Leu Ser Gly Ala Leu Leu He Val Val 

290 295 300 

Val Val Gly Arg Ser Val Ser Leu His Arg Gin Gin He Lys Glu Trp 
305 310 315 320 

Leu Ala Arg Arg Ala Asn Asn Pro Leu Pro 
325 330 



<210> 262 
<211> 340 
<212> PRT 
<213> E. Coli 

<400> 262 

Met Thr Leu His Arg Phe Lys Lys He Ala Leu Leu Ser Ala Leu Gly 

15 10 15 

He Ala Ala He Ser Met Asn Val Gin Ala Ala Glu Arg He Ala Phe 

20 25 30 

He Pro Lys Leu Val Gly Val Gly Phe Phe Thr Ser Gly Gly Asn Gly 

35 40 45 

Ala Gin Gin Ala Gly Lys Glu Leu Gly Val Asp Val Thr Tyr Asp Gly 

50 55 60 

Pro Thr Glu Pro Ser Val Ser Gly Gin Val Gin Leu He Asn Asn Phe 
65 70 75 80 

Val Asn Gin Gly Tyr Asn Ala He He Val Ser Ala Val Ser Pro Asp 

85 90 95 

Gly Leu Cys Pro Ala Leu Lys Arg Ala Met Gin Arg Gly Val Arg Val 

100 105 110 

Leu Thr Trp Asp Ser Asp Thr Lys Pro Glu Cys Arg Ser Tyr Tyr He 

115 120 125 

Asn Gin Gly Thr Pro Ala Gin Leu Gly Gly Met Leu Val Asp Met Ala 

130 135 140 

Ala Arg Gin Val Asn Lys Asp Lys Ala Lys Val Ala Phe Phe Tyr Ser 
145 150 155 160 

Ser Pro Thr Val Thr Asp Gin Asn Gin Trp Val Lys Glu Ala Lys Ala 

165 170 175 

Lys He Ala Lys Glu His Pro Gly Trp Glu He Val Thr Thr Gin Phe 

180 185 190 

Gly Tyr Asn Asp Ala Thr Lys Ser Leu Gin Thr Ala Glu Gly He Leu 

195 200 205 

Lys Ala Tyr Ser Asp Leu Asp Ala He He Ala Pro Asp Ala Asn Ala 

210 215 220 

Leu Pro Ala Ala Ala Gin Ala Ala Glu Asn Leu Lys Asn Asp Lys Val 
225 230 235 240 

Ala He Val Gly Phe Ser Thr Pro Asn Val Met Arg Pro Tyr Val Glu 

245 250 255 

Arg Gly Thr Val Lys Glu Phe Gly Leu Trp Asp Val Val Gin Gin Gly 

260 265 270 

Lys He Ser Val Tyr Val Ala Asp Ala Leu Leu Lys Lys Gly Ser Met 

275 280 285 

Lys Thr Gly Asp Lys Leu Asp He Lys Gly Val Gly Gin Val Glu Val 
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290 



295 



300 



Ser Pro Asn Ser Val Gin Gly Tyr Asp Tyr Glu Ala Asp Gly Asn Gly 
305 310 315 320 

lie Val Leu Leu Pro Glu Arg Val lie Phe Asn Lys Glu Asn lie Gly 
325 330 335 

Lys Tyr Asp Phe 
340 



<210> 263 
<211> 291 
<212> PRT 
<213> E. Coli 

<400> 263 

Met Ala Asp Leu Asp Asp He Lys Asp Gly Lys Asp Phe Arg Thr Asp 

15 10 15 

Gin Pro Gin Lys Asn He Pro Phe Thr Leu Lys Gly Cys Gly Ala Leu 

20 25 30 

Asp Trp Gly Met Gin Ser Arg Leu Ser Arg He Phe Asn Pro Lys Thr 

35 40 45 

Gly Lys Thr Val Met Leu Ala Phe Asp His Gly Tyr Phe Gin Gly Pro 

50 55 60 

Thr Thr Gly Leu Glu Arg He Asp He Asn He Ala Pro Leu Phe Glu 
65 70 75 80 

His Ala Asp Val Leu Met Cys Thr Arg Gly He Leu Arg Ser Val Val 

85 90 95 

Pro Pro Ala Thr Asn Arg Pro Val Val Leu Arg Ala Ser Gly Ala Asn 

100 105 110 

Ser He Leu Ala Glu Leu Ser Asn Glu Ala Val Ala Leu Ser Met Asp 

115 120 125 

Asp Ala Val Arg Leu Asn Ser Cys Ala Val Ala Ala Gin Val Tyr He 

130 135 140 

Gly Ser Glu Tyr Glu His Gin Ser He Lys Asn He He Gin Leu Val 
145 150 155 . 160 

Asp Ala Gly Met Lys Val Gly Met Pro Thr Met Ala Val Thr Gly Val 

165 170 175 

Gly Lys Asp Met Val Arg Asp Gin Arg Tyr Phe Ser Leu Ala Thr Arg 

180 185 190 

He Ala Ala Glu Met Gly Ala Gin He He Lys Thr Tyr Tyr Val Glu 

195 200 205 

Lys Gly Phe Glu Arg He Val Ala Gly Cys Pro Val Pro He Val He 

210 215 220 

Ala Gly Gly Lys Lys Leu Pro Glu Arg Glu Ala Leu Glu Met Cys Trp 
225 230 235 240 

Gin Ala He Asp Gin Gly Ala Ser Gly Val Asp Met Gly Arg Asn He 

245 250 255 

Phe Gin Ser Asp His Pro Val Ala Met Met Lys Ala Val Gin Ala Val 

260 265 270 

Val His His Asn Glu Thr Ala Asp Arg Ala Tyr Glu Leu Tyr Leu Ser 

275 280 285 

Glu Lys Gin 



290 



<210> 
<211> 



264 
96 
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<212> PRT 
<213> E. Coli 



<400> 264 
Met His Val Thr 
1 

Glu Phe He Glu 
20 

Glu Gly Asn Leu 
35 

Arg Phe Tyr He 
50 

His Lys Thr Thr 
65 

Leu Met Thr Gly 



Leu Val Glu He 
5 

Val Phe Arg Gin 

Arg Phe Asp Val 
40 

Tyr Glu Ala Tyr 
55 

Pro His Tyr Lys 
70 

Pro Arg Lys Lys 
85 



Asn Val His Glu 
10 

Asn His Leu Gly 
25 

Leu Gin Asp Pro 

Lys Asp Glu Asp 
60 

Thr Cys Val Ala 
75 

Arg Leu Phe Asn 
90 



Asp Lys Val Asp 
15 

Ser Val Gin Glu 
30 

Glu Val Asn Ser 
45 

Ala Val Ala Phe 

Lys Leu Glu Ser 
80 

Gly Leu Met Pro 
95 



<210> 265 
<211> 383 
<212> PRT 
<213> E. Coli 



<400> 265 



Met 


Phe 


\3l. U 


Jrr o 


IVfQ-f- 


Sj±U 


-Ueu inr 


Asn 


Asp 


Ala 


Val 


He 


Lys 


Val 


He 


1 








o 








10 










15 




Gly Val 


Gly 


Gly 


Gly 


Gly 




Ala 


V dJ- 


\J-L U. 






vaj- 


Arg 


Lj-LU 








20 








25 










30 




Arg 


He 


Glu 


Gly 


Val 


Glu 


Phe Phe 


Ala 


Val 


Asn 


Thr 


Asp 


Ala 


Gin 


Ala 






35 








40 










45 








Leu 


Arg 


Lys 


Thr 


Ala 


Val 


Gly Gin 


Thr 


He 


Gin 


He 


Gly 


Ser 


Gly 


He 




50 










55 








60 








Thr 


Lys 


Gly 


Leu 


Gly 


Ala 


Gly Ala 


Asn 


Pro 


Glu 


Val 


Gly 


Arg 


Asn 


Ala 


65 










70 








75 








80 


Ala 


Asp 


Glu 


Asp 


Arg 


Asp 


Ala Leu 


Arg 


Ala 


Ala 


Leu 


Glu 


Gly 


Ala 


Asp 










85 








90 










95 




Met 


Val 


Phe 


He 


Ala 


Ala 


Gly Met 


Gly 


Gly 


Gly 


Thr 


Gly 


Thr 


Gly 


Ala 








100 








105 










110 






Ala 


Pro 


Val 


Val 


Ala 


Glu 


Val Ala 


Lys 


Asp 


Leu 


Gly 


He 


Leu 


Thr 


Val 






115 








120 










125 








Ala 


Val 


Val 


Thr 


Lys 


Pro 


Phe Asn 


Phe 


Glu 


Gly 


Lys 


Lys 


Arg 


Met 


Ala 




130 










135 








140 










Phe 


Ala 


Glu 


Gin 


Gly 


He 


Thr Glu 


Leu 


Ser 


Lys 


His 


Val 


Asp 


Ser 


Leu 


145 










150 








155 










160 


He 


Thr 


He 


Pro 


Asn 


Asp 


Lys Leu 


Leu 


Lys 


Val 


Leu 


Gly 


Arg 


Gly 


He 










165 








170 










175 




Ser 


Leu 


Leu 


Asp 


Ala 


Phe 


Gly Ala 


Ala 


Asn 


Asp 


Val 


Leu 


Lys 


Gly 


Ala 








180 








185 










190 






Val 


Gin 


Gly 


He 


Ala 


Glu 


Leu He 


Thr 


Arg 


Pro 


Gly 


Leu 


Met 


Asn 


Val 






195 








200 










205 








Asp 


Phe 


Ala 


Asp 


Val 


Arg 


Thr Val 


Met 


Ser 


Glu 


Met 


Gly 


Tyr 


Ala 


Met 




210 










215 








220 










Met 


Gly 


Ser 


Gly 


Val 


Ala 


Ser Gly 


Glu 


Asp 


Arg 


Ala 


Glu 


Glu 


Ala 


Ala 


225 










230 








235 










240 


Glu 


Met 


Ala 


He 


Ser 


Ser 


Pro Leu 


Leu 


Glu 


Asp 


He 


Asp 


Leu 


Ser 


Gly 










245 








250 










255 


Ala 


Arg 


Gly 


Val 


Leu 


Val 


Asn He 


Thr 


Ala 


Gly 


Phe 


Asp 


Leu 


Arg 


Leu 
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260 

Asp Glu Phe Giu 
275 

Asn Ala Thr Vai 
290 

Glu Leu Arg Val 
305 

Pro Glu lie Thr 

Asp Arg Tyr Gin 
340 

Pro Val Ala Lys 
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-116- 



Ala Met 

Glu Tyr 
210 
Ala Arg 
225 

Cys lie 

Leu Val 

He Thr 

Glu He 
290 
Tyr Arg 
305 

He Met 

Asp Glu 

Leu Ser 

Gin Asp 
370 
Pro Gly 
385 

Gly He 
Asn Glu 
Ala Leu 



Ala 
195 
His 

Tyr 

Leu 

Leu 

Arg 
275 
Tyr 

Pro 

Val 

Leu 

His 
355 
Arg 

Leu 

Gly 

Asn 

Met 
435 



180 
His 

Leu 

Pro 

His 

He 
260 
Thr 

Asp 

Gly 

Ser 

He 
340 
Trp 

Ser 

Tyr 

He 

Leu 
420 
Val 



Thr Arg 

Glu Gly 

Ser Tyr 
230 
Tyr Thr 
245 

Asp Ala 

Phe Pro 

He Val 

Thr Ser 
310 
Gly Leu 
325 

Ala Gin 

Leu Gly 

Arg He 

He Ala 
390 
Arg He 
405 

Thr Ala 
Ala Ala 



Ala Met 
200 
Glu He 
215 

Asn Thr 

Glu Asn 

Gly Cys 

Val Asn 
280 
Leu Glu 
295 

He Leu 

Val Lys 

Asn Ala 

Leu Asp 
360 
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He Leu Leu Gly He Val Gly Ala 
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25 

Pro Arg Leu Ser 

Asp Glu His Ser 
60 

Gin Phe Ala Val 
75 

Thr Asp Asp Thr 
90 

Leu Met Glu Gly 
105 



Ser 


Thr 


Pro 


Phe 






15 




Arg 


His 


Leu 


Asp 




30 






Val 


Met 


His 


Gly 


45 








Ala 


Glu 


Pro 


Asp 


Phe 


Pro 


Pro 


Glu 








80 


Tyr 


Phe 


Asn 


He 






95 




Ala 


Gin 


Gin 


Arg 




110 







<210> 280 
<211> 246 
<212> PRT 
<213> E. Coli 



<400> 280 

Met Lys Phe Lys Val He Ala Leu Ala Ala Leu Met Gly He Ser Gly 

15 10 15 

Met Ala Ala Gin Ala Asn Glu Leu Pro Asp Gly Pro His He Val Thr 

20 25 30 

Ser Gly Thr Ala Ser Val Asp Ala Val Pro Asp He Ala Thr Leu Ala 

35 40 45 

He Glu Val Asn Val Ala Ala Lys Asp Ala Ala Thr Ala Lys Lys Gin 

50 55 60 

Ala Asp Glu Arg Val Ala Gin Tyr He Ser Phe Leu Glu Leu Asn Gin 
65 70 75 80 

He Ala Lys Lys Asp He Ser Ser Ala Asn Leu Arg Thr Gin Pro Asp 

85 90 95 

Tyr Asp Tyr Gin Asp Gly Lys Ser He Leu Lys Gly Tyr Arg Ala Val 
100 105 110 
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Arg 


Thr 


Val 


Glu 


Val 


Thr 


Leu 


Arg 


Gin 


Leu 


Asp 


Lvs 


Leu 


Asn 


Ser 


Leu 




115 










120 










125 








Leu Asp 


Gly Ala 


Leu 


Lys 


Ala 


Gly Leu Asn 


Glu 


He 


Arg 


Ser 


Val 


Ser 




130 










135 










140 










Leu 




V ci _L 


Ala 


Gin 


Pro 


Asp 


Ala 


Tvr 


Lys 


Asp 


Lvs 


Ala 


Arg 


Lys 


Ala 












150 










155 










160 




T 1 O 




ii 


Ala 


He 


His 


Gin 


Ala 


Gin 


Glu 


Leu 


Ala 


Asn 


Gly 


Phe 








165 










170 










175 




His 


Arg 


Lys 


Leu 
180 


Gly 


Pro 


Val 


Tyr 


Ser 
185 


Val 


Arg 


Tvr 


His 


Val 
190 


Ser 


Asn 


Tyr 


Gin 


Pro 


Ser 


Pro 


Met 


Val 


Arg 


Met 


Met 


Lys 


Ala 


Asp 


Ala 


Ala 


Pro 




195 










200 










205 








Val 


Ser 
210 


Ala 


Gin 


Glu 


Thr 


Tyr 
215 


Glu 


Gin 


Ala 


Ala 


He 
220 


Gin 


Phe 


Asp 


Asp 


Gin 


Val 


Asp 


Val 


Val 


Phe 


Gin 


Leu 


Glu 


Pro 


Val 


Asp 


Gin 


Gin 


Pro 


Ala 


225 










230 










235 










240 


Lys 


Thr 


Pro 


Ala 


Ala 


Gin 























245 



<210> 281 
<211> 464 
<212> PRT 
<213> E. Coll 





<400> 


281 


























Met 


Leu 


Leu 


Leu 


Asp 


Ala 


Cys 


Ser 


Gin 


Met 


Cys 


Pro 


Ser 


Phe 


Arg 


Arg 


1 








5 










10 










15 




Phe 


Gin 


Thr 


Val 


Phe 


His 


Asn 


Ser 


Ser 


He 


Phe 


Leu 


Pro 


Tyr 


Trp 


Leu 








20 










25 










30 






Ala 


Thr 


Leu 


Val 


Ser 


Phe 


Arg 


Glu 


Thr 


Phe 


Gin 


Glu 


Glu 


Lys 


Leu 


Leu 






35 










40 










45 








Thr 


Met 


Lys 


Gly 


Ser 


Tyr 


Lys 


Ser 


Arg 


Trp 


Val 


He 


Val 


He 


Val 


Val 




50 










55 










60 










Val 


He 


Ala 


Ala 


He 


Ala 


Ala 


Phe 


Trp 


Phe 


Trp 


Gin 


Gly Arg Asn Asp 


65 










70 










75 










80 


Ser 


Arg 


Ser 


Ala 


Ala 


Pro 


Gly Ala 


Thr 


Lys 


Gin 


Ala 


Gin 


Gin 


Ser 


Pro 










85 










90 










95 




Ala Gly Gly Arg 


Arg 


Gly Met 


Arg 


Ser 


Gly 


Pro 


Leu 


Ala 


Pro 


Val 


Gin 








100 










105 










110 






Ala 


Ala 


Thr 


Ala 


Val 


Glu 


Gin 


Ala 


Val 


Pro 


Arg 


Tyr 


Leu 


Thr 


Gly 


Leu 






115 










120 










125 








Gly 


Thr 


He 


Thr 


Ala 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Arg 


Ser 


Arg 


Val 


Asp 




130 










135 










140 










Gly 


Gin 


Leu 


He 


Ala 


Leu 


His 


Phe 


Gin 


Glu 


Gly 


Gin 


Gin 


Val 


Lys 


Ala 


145 










150 










155 










160 


Gly Asp 


Leu 


Leu 


Ala 


Glu 


He 


Asp 


Pro 


Ser 


Gin 


Phe 


Lys 


Val 


Ala 


Leu 










165 










170 










175 




Ala 


Gin 


Ala 


Gin 


Gly 


Gin 


Leu 


Ala 


Lys 


Asp 


Lys 


Ala 


Thr 


Leu 


Ala 


Asn 








180 










185 










190 






Ala 


Arg 


Arg 


Asp 


Leu 


Ala 


Arg 


Tyr 


Gin 


Gin 


Leu 


Ala 


Lys 


Thr 


Asn 


Leu 






195 










200 










205 








Val 


Ser 


Arg 


Gin 


Glu 


Leu 


Asp 


Ala 


Gin 


Gin 


Ala 


Leu 


Val 


Ser 


Glu 


Thr 




210 










215 










220 










Glu 


Gly 


Thr 


He 


Lys 


Ala 


Asp 


Glu 


Ala 


Ser 


Val 


Ala 


Ser 


Ala 


Gin 


Leu 


225 










230 










235 










240 


Gin 


Leu 


Asp 


Trp 


Ser 


Arg 


He 


Thr 


Ala 


Pro 


Val 


Asp 


Gly Arg 


Val 


Gly 
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Leu Lys Gin Val 
260 

Gly He Val Val 
275 

Leu Pro Glu Ser 
290 

Lys Pro Leu Val 
305 

Ser Glu Gly Thr 

Gly Thr He Lys 
340 

Phe Pro Asn Gin 
355 

Asn Ala Val Val 
370 

His Phe Val Trp 
385 

Val Thr Pro Gly 

He Ser Ala Gly 
420 

Glu Gly Ala Lys 
435 

Glu Glu Lys Ala 
450 



245 

Asp Val Gly Asn 

He Thr Gin Thr 
280 

Asp He Ala Thr 
295 

Val Glu Ala Trp 
310 

Leu Leu Ser Leu 
325 

Val Lys Ala Arg 

Phe Val Asn Ala 
360 

He Pro Thr Ala 
375 

Val Leu Asn Ser 
390 

He Gin Asp Ser 
405 

Asp Arg Val Val 

Val Glu Val Val 
440 

Thr Ser Arg Glu 
455 



250 

Gin He Ser Ser 
265 

His Pro He Asp 

Val Val Gin Ala 
300 

Asp Arg Thr Asn 
315 

Asp Asn Gin He 
330 

Phe Asn Asn Gin 
345 

Arg Met Leu Val 

Ala Leu Gin Met 
380 

Glu Asn Lys Val 
395 

Gin Lys Val Val 
410 

Thr Asp Gly He 
425 

Glu Ala Gin Ser 

Tyr Ala Lys Lys 
460 



255 

Gly Asp Thr Thr 
270 

Leu Val Phe Thr 
285 

Gin Lys Ala Gly 

Ser Lys Lys Leu 
320 

Asp Ala Thr Thr 
335 

Asp Asp Ala Leu 
350 

Asp Thr Glu Gin 
365 

Gly Asn Glu Gly 

Ser Lys His Leu 
400 

He Arg Ala Gly 
415 

Asp Arg Leu Thr 
430 

Ala Thr Thr Pro 
445 

Gly Ala Arg Ser 



<210> 282 
<211> 1040 
<212> PRT 
<213> E. Coll 



<400> 282 
Met Gin Val Leu 
1 

He Met Arg Pro 
20 

Ala Gly He He 
35 

Val Asp Tyr Pro 
50 

Pro Asp Val Met 
65 

Gly Gin Met Ser 

Ala Ser Val He 
100 

Ala Glu Gin Glu 
115 

Pro Ser Asp Leu 
130 

Asp Pro Pro He 
145 

Thr Gin Val Glu 



Pro Pro Ser Ser 
5 

Val Ala Thr Thr 

Gly Tyr Arg Ala 
40 

Thr He Gin Val 
55 

Thr Ser Ala Val 
70 

Gly Leu Lys Gin 
85 

Thr Leu Gin Phe 

Val Gin Ala Ala 
120 

Pro Asn Pro Pro 
135 

Met Thr Leu Ala 
150 

Asp Met Val Glu 
165 



Thr Gly Gly Pro 
10 

Leu Leu Met Val 
25 

Leu Pro Val Ser 

Val Thr Leu Tyr 
60 

Thr Ala Pro Leu 
75 

Met Ser Ser Gin 
90 

Gin Leu Thr Leu 
105 

He Asn Ala Ala 

Val Tyr Ser Lys 
140 

Val Thr Ser Thr 
155 

Thr Arg Val Ala 

170 



Ser Arg Leu Phe 
15 

Ala He Leu Leu 
30 

Ala Leu Pro Glu 
45 

Pro Gly Ala Ser 

Glu Arg Gin Phe 
80 

Ser Ser Gly Gly 
95 

Pro Leu Asp Val 
110 

Thr Asn Leu Leu 
125 

Val Asn Pro Ala 

Ala Met Pro Met 
160 

Gin Lys He Ser 
175 
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Gin lie Ser Gly Val Gly Leu Val Thr Leu Ser Gly Gly Gin Arg Pro 

180 185 190 

Ala Val Arg Val Lys Leu Asn Ala Gin Ala He Ala Ala Leu Gly Leu 

195 200 205 

Thr Ser Glu Thr Val Arg Thr Ala He Thr Gly Ala Asn Val Asn Ser 

210 215 220 

Ala Lys Gly Ser Leu Asp Gly Pro Ser Arg Ala Val Thr Leu Ser Ala 
225 230 235 240 

Asn Asp Gin Met Gin Ser Ala Glu Glu Tyr Arg Gin Leu He He Ala 

245 250 255 

Tyr Gin Asn Gly Ala Pro He Arg Leu Gly Asp Val Ala Thr Val Glu 

260 265 270 

Gin Gly Ala Glu Asn Ser Trp Leu Gly Ala Trp Ala Asn Lys Glu Gin 

275 280 285 

Ala He Val Met Asn Val Gin Arg Gin Pro Gly Ala Asn He He Ser 

290 295 300 

Thr Ala Asp Ser He Arg Gin Met Leu Pro Gin Leu Thr Glu Ser Leu 
305 310 315 320 

Pro Lys Ser Val Lys Val Thr Val Leu Ser Asp Arg Thr Thr Asn He 

325 330 335 

Arg Ala Ser Val Asp Asp Thr Gin Phe Glu Leu Met Met Ala He Ala 

340 345 350 

Leu Val Val Met He He Tyr Leu Phe Leu Arg Asn He Pro Ala Thr 

355 360 365 

He He Pro Gly Val Ala Val Pro Leu Ser Leu He Gly Thr Phe Ala 

370 375 380 

Val Met Val Phe Leu Asp Phe Ser He Asn Asn Leu Thr Leu Met Ala 
385 390 395 400 

Leu Thr He Ala Thr Gly Phe Val Val Asp Asp Ala He Val Val He 

405 410 415 

Glu Asn He Ser Arg Tyr He Glu Lys Gly Glu Lys Pro Leu Ala Ala 

420 425 430 

Ala Leu Lys Gly Ala Gly Glu He Gly Phe Thr He He Ser Leu Thr 

435 440 445 

Phe Ser Leu He Ala Val Leu He Pro Leu Leu Phe Met Gly Asp He 

450 455 460 

Val Gly Arg Leu Phe Arg Glu Phe Ala He Thr Leu Ala Val Ala He 
465 470 475 480 

Leu He Ser Ala Val Val Ser Leu Thr Leu Thr Pro Met Met Cys Ala 

485 490 495 

Arg Met Leu Ser Gin Glu Ser Leu Arg Lys Gin Asn Arg Phe Ser Arg 

500 505 510 

Ala Ser Glu Lys Met Phe Asp Arg He He Ala Ala Tyr Gly Arg Gly 

515 520 525 

Leu Ala Lys Val Leu Asn His Pro Trp Leu Thr Leu Ser Val Ala Leu 

530 535 540 

Ser Thr Leu Leu Leu Ser Val Leu Leu Trp Val Phe He Pro Lys Gly 
545 550 555 560 

Phe Phe Pro Val Gin Asp Asn Gly He He Gin Gly Thr Leu Gin Ala 

565 570 575 

Pro Gin Ser Ser Ser Phe Ala Asn Met Ala Gin Arg Gin Arg Gin Val 

580 585 590 

Ala Asp Val He Leu Gin Asp Pro Ala Val Gin Ser Leu Thr Ser Phe 

595 600 605 

Val Gly Val Asp Gly Thr Asn Pro Ser Leu Asn Ser Ala Arg Leu Gin 

610 615 620 

He Asn Leu Lys Pro Leu Asp Glu Arg Asp Asp Arg Val Gin Lys Val 
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625 630 635 640 

He Ala Arg Leu Gin Thr Ala Val Asp Lys Val Pro Gly Val Asp Leu 

645 650 655 

Phe Leu Gin Pro Thr Gin Asp Leu Thr He Asp Thr Gin Val Ser Arg 

660 665 670 

Thr Gin Tyr Gin Phe Thr Leu Gin Ala Thr Ser Leu Asp Ala Leu Ser 

675 680 685 

Thr Trp Val Pro Gin Leu Met Glu Lys Leu Gin Gin Leu Pro Gin Leu 

690 695 700 

Ser Asp Val Ser Ser Asp Trp Gin Asp Lys Gly Leu Val Ala Tyr Val 
705 710 715 720 

Asn Val Asp Arg Asp Ser Ala Ser Arg Leu Gly He Ser Met Ala Asp 

725 730 735 

Val Asp Asn Ala Leu Tyr Asn Ala Phe Gly Gin Arg Leu He Ser Thr 

740 745 750 

He Tyr Thr Gin Ala Asn Gin Tyr Arg Val Val Leu Glu His Asn Thr 

755 760 765 

Glu Asn Thr Pro Gly Leu Ala Ala Leu Asp Thr He Arg Leu Thr Ser 

770 775 780 

Ser Asp Gly Gly Val Val Pro Leu Ser Ser He Ala Lys He Glu Gin 
785 790 795 800 

Arg Phe Ala Pro Leu Ser He Asn His Leu Asp Gin Phe Pro Val Thr 

805 810 815 

Thr He Ser Phe Asn Val Pro Asp Asn Tyr Ser Leu Gly Asp Ala Val 

820 825 830 

Gin Ala He Met Asp Thr Glu Lys Thr Leu Asn Leu Pro Val Asp He 

835 840 845 

Thr Thr Gin Phe Gin Gly Ser Thr Leu Ala Phe Gin Ser Ala Leu Gly 

850 855 860 

Ser Thr Val Trp Leu He Val Ala Ala Val Val Ala Met Tyr He Val 
865 870 875 880 

Leu Gly He Leu Tyr Glu Ser Phe He His Pro He Thr He Leu Ser 

885 890 895 

Thr Leu Pro Thr Ala Gly Val Gly Ala Leu Leu Ala Leu Leu He Ala 

900 905 910 

Gly Ser Glu Leu Asp Val He Ala He He Gly He He Leu Leu He 

915 920 925 

Gly He Val Lys Lys Asn Ala He Met Met He Asp Phe Ala Leu Ala 

930 935 940 

Ala Glu Arg Glu Gin Gly Met Ser Pro Arg Glu Ala He Tyr Gin Ala 
945 950 955 960 

Cys Leu Leu Arg Phe Arg Pro He Leu Met Thr Thr Leu Ala Ala Leu 

965 970 975 

Leu Gly Ala Leu Pro Leu Met Leu Ser Thr Gly Val Gly Ala Glu Leu 

980 985 990 

Arg Arg Pro Leu Gly He Gly Met Val Gly Gly Leu He Val Ser Gin 

995 1000 1005 

Val Leu Thr Leu Phe Thr Thr Pro Val He Tyr Leu Leu Phe Asp Arg 

1010 1015 1020 

Leu Ala Leu Trp Thr Lys Ser Arg Phe Ala Arg His Glu Glu Glu Ala 
1025 1030 1035 1040 



<210> 283 
<211> 1025 
<212> PRT 
<213> E. Coll 
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<400> 283 

Met Lys Phe Phe Ala Leu Phe He Tyr Arg Pro Val Ala Thr He Leu 

15 10 15 

Leu Ser Val Ala He Thr Leu Cys Gly He Leu Gly Phe Arg Met Leu 

20 25 30 

Pro Val Ala Pro Leu Pro Gin Val Asp Phe Pro Val He He Val Ser 

35 40 45 

Ala Ser Leu Pro Gly Ala Ser Pro Glu Thr Met Ala Ser Ser Val Ala 

50 55 60 

Thr Pro Leu Glu Arg Ser Leu Gly Arg He Ala Gly Val Ser Glu Met 
65 70 75 80 

Thr Ser Ser Ser Ser Leu Gly Ser Thr Arg He He Leu Gin Phe Asp 

85 90 95 

Phe Asp Arg Asp He Asn Gly Ala Ala Arg Asp Val Gin Ala Ala He 

100 105 110 

Asn Ala Ala Gin Ser Leu Leu Pro Ser Gly Met Pro Ser Arg Pro Thr 

115 120 125 

Tyr Arg Lys Ala Asn Pro Ser Asp Ala Pro He Met He Leu Thr Leu 

130 135 140 

Thr Ser Asp Thr Tyr Ser Gin Gly Glu Leu Tyr Asp Phe Ala Ser Thr 
145 150 155 160 

Gin Leu Ala Pro Thr He Ser Gin He Asp Gly Val Gly Asp Val Asp 

165 170 175 

Val Gly Gly Ser Ser Leu Pro Ala Val Arg Val Gly Leu Asn Pro Gin 

180 185 190 

Ala Leu Phe Asn Gin Gly Val Ser Leu Asp Asp Val Arg Thr Ala Val 

195 200 205 

Ser Asn Ala Asn Val Arg Lys Pro Gin Gly Ala Leu Glu Asp Gly Thr 

210 215 220 

His Arg Trp Gin He Gin Thr Asn Asp Glu Leu Lys Thr Ala Ala Glu 
225 230 235 240 

Tyr Gin Pro Leu He He His Tyr Asn Asn Gly Gly Ala Val Arg Leu 

245 250 255 

Gly Asp Val Ala Thr Val Thr Asp Ser Val Gin Asp Val Arg Asn Ala 

260 265 270 

Gly Met Thr Asn Ala Lys Pro Ala He Leu Leu Met He Arg Lys Leu 

275 280 285 

Pro Glu Ala Asn He He Gin Thr Val Asp Ser He Arg Ala Lys Leu 

290 295 300 

Pro Glu Leu Gin Glu Thr He Pro Ala Ala He Asp Leu Gin He Ala 
305 310 315 320 

Gin Asp Arg Ser Pro Thr He Arg Ala Ser Leu Glu Glu Val Glu Gin 

325 330 335 

Thr Leu He He Ser Val Ala Leu Val He Leu Val Val Phe Leu Phe 

340 345 350 

Leu Arg Ser Gly Arg Ala Thr He He Pro Ala Val Ser Val Pro Val 

355 360 365 

Ser Leu He Gly Thr Phe Ala Ala Met Tyr Leu Cys Gly Phe Ser Leu 

370 375 380 

Asn Asn Leu Ser Leu Met Ala Leu Thr He Ala Thr Gly Phe Val Val 
385 390 395 400 

Asp Asp Ala He Val Val Leu Glu Asn He Ala Arg His Leu Glu Ala 

405 410 415 

Gly Met Lys Pro Leu Gin Ala Ala Leu Gin Gly Thr Arg Glu Val Gly 

420 425 430 

Phe Thr Val Leu Ser Met Ser Leu Ser Leu Val Ala Val Phe Leu Pro 
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435 440 445 

Leu Leu Leu Met Gly Gly Leu Pro Gly Arg Leu Leu Arg Glu Phe Ala 

450 455 460 

Val Thr Leu Ser Val Ala He Gly He Ser Leu Leu Val Ser Leu Thr 
465 470 475 480 

Leu Thr Pro Met Met Cys Gly Trp Met Leu Lys Ala Ser Lys Pro Arg 

485 490 495 

Glu Gin Lys Arg Leu Arg Gly Phe Gly Arg Met Leu Val Ala Leu Gin 

500 505 510 

Gin Gly Tyr Gly Lys Ser Leu Lys Trp Val Leu Asn His Thr Arg Leu 

515 520 525 

Val Gly Val Val Leu Leu Gly Thr He Ala Leu Asn He Trp Leu Tyr 

530 535 540 

He Ser He Pro Lys Thr Phe Phe Pro Glu Gin Asp Thr Gly Val Leu 
545 550 555 560 

Met Gly Gly He Gin Ala Asp Gin Ser He Ser Phe Gin Ala Met Arg 

565 570 575 

Gly Lys Leu Gin Asp Phe Met Lys He He Arg Asp Asp Pro Ala Val 

580 585 590 

Asp Asn Val Thr Gly Phe Thr Gly Gly Ser Arg Val Asn Ser Gly Met 

595 600 605 

Met Phe He Thr Leu Lys Pro Arg Asp Glu Arg Ser Glu Thr Ala Gin 

610 615 620 

Gin He He Asp Arg Leu Arg Val Lys Leu Ala Lys Glu Pro Gly Ala 
625 630 635 640 

Asn Leu Phe Leu Met Ala Val Gin Asp He Arg Val Gly Gly Arg Gin 

645 650 655 

Ser Asn Ala Ser Tyr Gin Tyr Thr Leu Leu Ser Asp Asp Leu Ala Ala 

660 665 670 

Leu Arg Glu Trp Glu Pro Lys He Arg Lys Lys Leu Ala Thr Leu Pro 

675 680 685 

Glu Leu Ala Asp Val Asn Ser Asp Gin Gin Asp Asn Gly Ala Glu Met 

690 695 700 

Asn Leu Val Tyr Asp Arg Asp Thr Met Ala Arg Leu Gly He Asp Val 
705 710 715 720 

Gin Ala Ala Asn Ser Leu Leu Asn Asn Ala Phe Gly Gin Arg Gin He 

725 730 735 

Ser Thr He Tyr Gin Pro Met Asn Gin Tyr Lys Val Val Met Glu Val 

740 745 750 

Asp Pro Arg Tyr Thr Gin Asp He Ser Ala Leu Glu Lys Met Phe Val 

755 760 765 

He Asn Asn Glu Gly Lys Ala He Pro Leu Ser Tyr Phe Ala Lys Trp 

770 775 780 

Gin Pro Ala Asn Ala Pro Leu Ser Val Asn His Gin Gly Leu Ser Ala 
785 790 795 800 

Ala Ser Thr He Ser Phe Asn Leu Pro Thr Gly Lys Ser Leu Ser Asp 

805 810 815 

Ala Ser Ala Ala He Asp Arg Ala Met Thr Gin Leu Gly Val Pro Ser 

820 825 830 

Thr Val Arg Gly Ser Phe Ala Gly Thr Ala Gin Val Phe Gin Glu Thr 

835 840 845 

Met Asn Ser Gin Val He Leu He He Ala Ala He Ala Thr Val Tyr 

850 855 860 

He Val Leu Gly He Leu Tyr Glu Ser Tyr Val His Pro Leu Thr He 
865 870 875 880 

Leu Ser Thr Leu Pro Ser Ala Gly Val Gly Ala Leu Leu Ala Leu Glu 
885 890 895 
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Leu 


Phe 


Asn 


Ala 
900 


Pro 


Phe 


Ser 


Leu 


He 
905 


Ala 


Leu 


He 


Gly 


He 
910 


Met 


Leu 


Leu 


He 


Gly 
915 


He 


Val 


Lys 


Lys 


Asn 
920 


Ala 


He 


Met 


Met 


Val 
925 


Asp 


Phe 


Ala 


Leu 


Glu 
930 


Ala 


Gin 


Arg 


His 


Gly 
935 


Asn 


Leu 


Thr 


Pro 


Gin 
940 


Glu 


Ala 


He 


Phe 


n n 


rl-Ld 






Leu 


Arg 


Phe 


Arg 


Pro 


He 


Met 


Met 


Thr 


Thr 


Leu 


Ala 


^ -J 










950 










955 










960 


Ala 


Leu 


Phe 


Gly 


Ala 


Leu 


Pro 


Leu 


Val 


Leu 


Ser 


Gly 


Gly Asp 


Gly 


Ser 










965 










970 










975 




Glu 


Leu 


Arg 


Gin 
980 


Pro 


Leu 


Gly 


He 


Thr 
985 


He 


Val 


Gly 


Gly 


Leu 
990 


Val 


Met 


Ser 


Gin 


Leu 


Leu 


Thr 


Leu 


Tyr 


Thr 


Thr 


Pro 


Val 


Val 


Tyr 


Leu 


Phe 


Phe 






995 










1000 








1005 






Asp 


Arg 


Leu 


Arg 


Leu 


Arg 


Phe 


Ser 


Arg 


Lys 


Pro 


Lys 


Gin 


Thr 


Val 


Thr 



1010 1015 1020 



Glu 
1025 



<210> 284 
<211> 471 
<212> PRT 
<213> E. Coli 

<400> 284 

Met Thr Asp Leu Pro Asp Ser Thr Arg Trp Gin Leu Trp He Val Ala 

15 10 15 

Phe Gly Phe Phe Met Gin Ser Leu Asp Thr Thr He Val Asn Thr Ala 

20 25 30 

Leu Pro Ser Met Ala Gin Ser Leu Gly Glu Ser Pro Leu His Met His 

35 40 45 

Met Val He Val Ser Tyr Val Leu Thr Val Ala Val Met Leu Pro Ala 

50 55 60 

Ser Gly Trp Leu Ala Asp Lys Val Gly Val Arg Asn He Phe Phe Thr 
65 70 75 80 

Ala He Val Leu Phe Thr Leu Gly Ser Leu Phe Cys Ala Leu Ser Gly 

85 90 95 

Thr Leu Asn Glu Leu Leu Leu Ala Arg Ala Leu Gin Gly Val Gly Gly 

100 105 110 

Ala Met Met Val Pro Val Gly Arg Leu Thr Val Met Lys He Val Pro 

115 120 125 

Arg Glu Gin Tyr Met Ala Ala Met Thr Phe Val Thr Leu Pro Gly Gin 

130 135 140 

Val Gly Pro Leu Leu Gly Pro Ala Leu Gly Gly Leu Leu Val Glu Tyr 
145 150 155 160 

Ala Ser Trp His Trp He Phe Leu He Asn He Pro Val Gly He He 

165 170 175 

Gly Ala He Ala Thr Leu Leu Leu Met Pro Asn Tyr Thr Met Gin Thr 

180 185 190 

Arg Arg Phe Asp Leu Ser Gly Phe Leu Leu Leu Ala Val Gly Met Ala 

195 200 205 

Val Leu Thr Leu Ala Leu Asp Gly Ser Lys Gly Thr Gly Leu Ser Pro 

210 215 220 

Leu Thr He Ala Gly Leu Val Ala Val Gly Val Val Ala Leu Val Leu 
225 230 235 240 

Tyr Leu Leu His Ala Arg Asn Asn Asn Arg Ala Leu Phe Ser Leu Lys 
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Leu Phe Arg Thr 
260 

Gly Arg He Gly 
275 

Gin He Gly Leu 
290 

Pro Met Val Leu 
305 

Val Asn Arg Phe 

Leu Ser Leu Val 
340 

Tyr Tyr Val Leu 
355 

Thr Arg Phe Ser 
370 

Asn Leu Ala Ser 
385 

Ser Met Ser He 

Gly Ser Gin His 
420 

Met Tyr Thr Trp 
435 

He Phe Ala Arg 
450 

Arg Arg Lys Arg 
465 



245 

Arg Thr Phe Ser 

Ser Gly Met Leu 
280 

Gly Phe Ser Pro 
295 

Gly Ser Met Gly 
310 

Gly Tyr Arg Arg 
325 

Thr Leu Leu Phe 

Pro Phe Val Leu 
360 

Ser Met Asn Thr 
375 

Ser Gly Asn Ser 
390 

Gly Val Thr He 
405 

Val Ser Val Asp 

Leu Ser Met Ala 
440 

Val Pro Asn Asp 
455 

Ser Ala Gin 
470 



250 

Leu Gly Leu Ala 
265 

Pro Phe Met Thr 

Phe His Ala Gly 
300 

Met Lys Arg He 
315 

Val Leu Val Ala 
330 

Met Thr Thr Ala 
345 

Phe Leu Gin Gly 

Leu Thr Leu Lys 
380 

Leu Leu Ser Met 
395 

Ala Gly Leu Leu 
410 

Ser Gly Thr Thr 
425 

Leu He He Ala 

Thr His Gin Asn 
460 



255 

Gly Ser Phe Ala 
270 

Pro Val Phe Leu 
285 

Leu Met Met He 

Val Val Gin Val 
320 

Thr Thr Leu Gly 
335 

Leu Leu Gly Trp 
350 

Met Val Asn Ser 
365 

Asp Leu Pro Asp 

He Met Gin Leu 
400 

Leu Gly Leu Phe 
415 

Gin Thr Val Phe 
430 

Leu Pro Ala Phe 
445 

Val Ala He Ser 



<210> 285 
<211> 344 
<212> PRT 
<213> Coli 

<400> 285 

Met Glu He Arg He Met Leu Phe He Leu Met Met Met Val Met Pro 

15 10 15 

Val Ser Tyr Ala Ala Cys Tyr Ser Glu Leu Ser Val Gin His Asn Leu 

20 25 30 

Val Val Gin Gly Asp Phe Ala Leu Thr Gin Thr Gin Met Ala Thr Tyr 

35 40 45 

Glu His Asn Phe Asn Asp Ser Ser Cys Val Ser Thr Asn Thr He Thr 

50 55 60 

Pro Met Ser Pro Ser Asp He He Val Gly Leu Tyr Asn Asp Thr He 
65 70 75 80 

Lys Leu Asn Leu His Phe Glu Trp Thr Asn Lys Asn Asn He Thr Leu 

85 90 95 

Ser Asn Asn Gin Thr Ser Phe Thr Ser Gly Tyr Ser Val Thr Val Thr 

100 105 110 

Pro Ala Ala Ser Asn Ala Lys Val Asn Val Ser Ala Gly Gly Gly Gly 

115 120 125 

Ser Val Met He Asn Gly Val Ala Thr Leu Ser Ser Ala Ser Ser Ser 

130 135 140 

Thr Arg Gly Ser Ala Ala Val Gin Phe Leu Leu Cys Leu Leu Gly Gly 
145 150 155 160 
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Lys Ser Trp Asp Ala Cys Val Asn Ser Tyr Arg Asn Ala Leu Ala Gin 

165 170 175 

Asn Ala Gly Val Tyr Ser Phe Asn Leu Thr Leu Ser Tyr Asn Pro lie 

180 185 190 

Thr Thr Thr Cys Lys Pro Asp Asp Leu Leu lie Thr Leu Asp Ser lie 

195 200 205 

Pro Val Ser Gin Leu Pro Ala Thr Gly Asn Lys Ala Thr lie Asn Ser 

210 215 220 

Lys Gin Gly Asp lie lie Leu Arg Cys Lys Asn Leu Leu Gly Gin Gin 
225 230 235 240 

Asn Gin Thr Ser Arg Lys Met Gin Val Tyr Leu Ser Ser Ser Asp Leu 

245 250 255 

Leu Thr Asn Ser Asn Thr lie Leu Lys Gly Ala Glu Asp Asn Gly Val 

260 265 270 

Gly Phe lie Leu Glu Ser Asn Gly Ser Pro Val Thr Leu Leu Asn lie 

275 280 285 

Thr Asn Ser Ser Lys Gly Tyr Thr Asn Leu Lys Glu Val Ala Ala Lys 

290 295 300 

Ser Lys Leu Thr Asp Thr Thr Val Ser lie Pro lie Thr Ala Ser Tyr 
305 310 315 320 

Tyr Val Tyr Asp Thr Asn Lys Val Lys Ser Gly Ala Leu Glu Ala Thr 

325 330 335 

Ala Leu lie Asn Val Lys Tyr Asp 
340 



<210> 286 

<211> 826 

<212> PRT 

<213> E. Coli 

<400> 286 

Met Leu Arg Met Thr Pro Leu Ala Ser Ala lie Val Ala Leu Leu Leu 

15 10 15 

Gly lie Glu Ala Tyr Ala Ala Glu Glu Thr Phe Asp Thr His Phe Met 

20 25 30 

lie Gly Gly Met Lys Asp Gin Gin Val Ala Asn lie Arg Leu Asp Asp 

35 40 45 

Asn Gin Pro Leu Pro Gly Gin Tyr Asp lie Asp lie Tyr Val Asn Lys 

50 55 60 

Gin Trp Arg Gly Lys Tyr Glu lie lie Val Lys Asp Asn Pro Gin Glu 
65 70 75 80 

Thr Cys Leu Ser Arg Glu Val lie Lys Arg Leu Gly lie Asn Ser Asp 

85 90 95 

Asn Phe Ala Ser Gly Lys Gin Cys Leu Thr Phe Glu Gin Leu Val Gin 

100 105 110 

Gly Gly Ser Tyr Thr Trp Asp lie Gly Val Phe Arg Leu Asp Phe Ser 

115 120 125 

Val Pro Gin Ala Trp Val Glu Glu Leu Glu Ser Gly Tyr Val Pro Pro 

130 135 140 

Glu Asn Trp Glu Arg Gly lie Asn Ala Phe Tyr Thr Ser Tyr Tyr Leu 
145 150 155 160 

Ser Gin Tyr Tyr Ser Asp Tyr Lys Ala Ser Gly Asn Asn Lys Ser Thr 

165 170 175 

Tyr Val Arg Phe Asn Ser Gly Leu Asn Leu Leu Gly Trp Gin Leu His 

180 185 190 

Ser Asp Ala Ser Phe Ser Lys Thr Asn Asn Asn Pro Gly Val Trp Lys 
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195 

Ser Asn Thr Leu 
210 

Leu Arg Val Gly 
225 

Arg Phe Arg Gly 

Ser Lys Gin Asn 
260 

Ala Leu Val Thr 
275 

Val Pro Pro Gly 
290 

Gly Ala Asp Leu 
305 

Thr Tyr Leu Val 

Val Ser Lys Tyr 
340 

Ser Lys Gin Ser 
355 

Asn Leu Leu Thr 
370 

Ala Phe Thr Leu 
385 

Val Asp Ala Thr 

Asp Gly Gin Ser 
420 

Ser Thr Arg Phe 
435 

Arg Thr Phe Asn 
450 

Arg Asp Glu Asn 
465 

Phe Gly Arg Lys 

Glu Gly Trp Gly 
500 

Gly Arg Ser Gly 
515 

Leu Arg Arg lie 
530 

Asn His His Glu 
545 

Asp Trp Gly Asp 

Asn Ser Thr Thr 
580 

Leu Ser Gly Thr 
595 

Leu Ser His Gin 
610 

Thr Trp Asn Ala 
625 

Ser Thr Tyr Arg 



200 

Tyr Leu Glu Arg 
215 

Asp Met Tyr Thr 
230 

Val Arg Leu Phe 
245 

Phe Thr Pro Arg 

He Glu Gin Asn 
280 

Pro Phe Ala He 
295 

Asp Val Ser Val 
310 

Pro Tyr Ala Ala 
325 

Asp Leu Ala Ala 

Asp Phe Val Gin 
360 

Leu Tyr Gly Gly 
375 

Gly Ala Gly Trp 
390 

Lys Ser His Ser 
405 

Tyr Gin He Ala 

Gly Leu Ala Ala 
440 

Asp His Val Trp 
455 

Asp Val Tyr Asp 
470 

Asn Ser Phe Ser 
485 

Ser Val Ser Leu 

Ser Ser Lys Asp 
520 

Ser Tyr Thr Leu 
535 

Glu Lys Arg Phe 
550 

Asp Val Ser Thr 
565 

Phe Asp Asp Gin 

Val Gly Ser Arg 
600 

His Gin Gly Asn 
615 

Pro Val Ala Thr 
630 

Gin Ala Gly Ala 
645 



Gly Phe Ala Gin 
220 

Ser Ser Asp He 
235 

Arg Asp Met Gin 
250 

Val Gin Gly He 
265 

Gly Phe Val Val 

Thr Asp Leu Gin 
300 

Lys Glu Ala Asp 
315 

Val Pro Asn Met 
330 

Gly Arg Ser His 
345 

Ala Gly Tyr Gin 

Ser Met Val Ala 
380 

Asn Thr Arg He 
395 

Lys Gin Asp Asn 
410 

Tyr Asn Lys Phe 
425 

Trp Arg Tyr Ser 

Ala Asn Asn Lys 
460 

He Ala Asp Tyr 
475 

Ala Asn Met Ser 
490 

Ser Thr Leu Trp 

505 

Tyr Gin Leu Ser 

Ala Ala Ser Gin 
540 

Asn He Phe He 
555 

Pro Arg Arg Gin 
570 

Gly Phe Ala Ser 
585 

Asp Gin Phe Asn 

Glu Thr Thr Ala 
620 

Val Asn Gly Ser 
635 

Ser Val Ser Gly 
650 



205 

Leu Leu Gly Thr 

Phe Asp Ser Val 
240 

Met Leu Pro Asn 
255 

Ala Gin Ser Asn 
270 

Tyr Gin Lys Glu 
285 

Leu Ala Gly Gly 

Gly Ser Val Thr 
320 

Leu Gin Pro Gly 
335 

He Glu Gly Ala 
350 

Tyr Gly Phe Asn 
365 

Asn Asn Tyr Tyr 

Gly Ala He Ser 
400 

Gly Asp Val Phe 
415 

Val Ser Gin Thr 
430 

Ser Arg Asp Tyr 
445 

Asp Asn Tyr Arg 

Tyr Gin Asn Asp 
480 

Gin Ser Leu Pro 
495 

Arg Asp Tyr Trp 
510 

Tyr Ser Asn Asn 
525 

Ala Tyr Asp Glu 

Ser He Pro Phe 
560 

He Tyr Met Ser 
575 

Asn Asn Thr Gly 
590 

Tyr Gly Val Asn 
605 

Gly Ala Asn Leu 

Tyr Ser Gin Ser 
640 

Gly He Val Ala 
655 
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TrD 


Ser 


Gly Gly 


Val 


Asn 


Leu 


Ala 


Asn 


Arg 


Leu 


Ser 


Glu 


Thr Phe 


Ala 








660 










665 










670 




Val 


Met 


675 


Ala 


Pro 


Gly 


He 


Lys 
680 


Asp 


Ala 


Tyr 


Val 


Asn 
685 


Gly Gin 


Lys 


Tyr 


Arg 


Thr 


Thr 


Asn 


Arg 


Asn 


Gly Val 


Val 


He 


Tyr 


Asp 


Gly Met 


Thr 




690 










695 










700 








Pro 


Tyr 


rij- y 




Asn 


His 


Leu 




Leu Asp Val 


Ser 


Gin 


Ser Asp 


Ser 


705 










710 










715 








720 


Glu 


Ala 


^ J_ u. 


xjeu 


Arg 


Gly Asn 


r\i^ y 


Lys 


He 


Ala 


Ala 


Pro 


Tyr Arg Gly 










725 










730 








735 




Ala 


Val 


Val 


Leu 
740 


Val 


Asn 


Phe 


ASp 


Thr 
745 


Asp 


Gin 


Arg 


Lys 


Pro Trp 
750 


Phe 


He 


Lys 


Ala 
755 


Leu 


Arg 


Ala 


Asp 


Gly 
760 


Gin 


Ser 


Leu 


Thr 


Phe 
765 


Gly Tyr 


Glu 


Val 


Asn 


i^Sp 




His 


Gly 


His 


Asn 


He 


Gly Val 


Val 


Gly Gin Gly 


Ser 




7 7 0 










775 










780 








Gin 


Leu 


Phe 


He 


Arg 


Thr 


Asn 


Glu 


Val 


Pro 


Pro 


Ser 


Val 


Asn Val 


Ala 


785 










790 










795 








800 


He 


Asp 


Lys 


Gin 


Gin 
805 


Gly 


Leu 


Ser 


Cys 


Thr 
810 


He 


Thr 


Phe 


Gly Lys 
815 


Glu 


He 


Asp 


Glu 


Ser 
820 


Arg 


Asn 


Tyr 


He 


Cys 
825 


Gin 













<210> 287 
<211> 239 
<212> PRT 
<213> E. Coll 



<400> 287 
Met Ala Ala He 
1 

Lys Gly Leu Leu 
20 

Ala Gly He Val 
35 

Lys Glu Val Met 
50 

Leu Gin Ala Trp 
65 

He Gin Val Pro 

Asn Ser Gly Gin 
100 

Thr Asn Lys Glu 
115 

Asn Ser Pro Glu 
130 

Asn Arg He Lys 
145 

Lys Ala Thr Phe 

Val He Lys Asn 
180 

Ala Asn Asn Val 
195 

Glu Ser Gin Ser 



Pro Trp Arg Pro 
5 

Ser Leu Leu He 

He Tyr Gly Thr 
40 

Val Gin Leu Met 
55 

He Asp Asp Gly 
70 

Phe Met Leu Thr 
85 

Gin Val Lys He 

Ser He Phe Tyr 
120 

Gin Glu Gly Lys 
135 

Leu Phe Tyr Arg 
150 

Lys Lys Leu Leu 
165 

Asp Ser Ala Asn 

Lys Val Asn Tyr 
200 

Val Asn Val Lys 



Phe Asn Leu Arg 
10 

Phe Ser Met Val 
25 

Arg He He Tyr 

Asn Gin Gly Asn 
60 

Asp Thr Ser Leu 
75 

Pro Pro Val Ala 
90 

Lys He Met Pro 
105 

Leu Asn Val Leu 

Asn Ala Leu Lys 
140 

Pro Ala Gly He 
155 

Val Asn Arg Ser 
170 

Trp Val Thr He 
185 

Glu Thr He Met 
Ser Asn Asn Ala 



Gly He Lys Met 
15 

Leu Pro Ala His 
30 

Pro Ala Glu Asn 
45 

Arg Ser Ser Leu 

Pro Pro Glu Lys 
80 

Lys He Gly Ala 
95 

Asn Lys Leu Pro 
110 

Asp He Pro Pro 
125 

Phe Ala Met Gin 

Ala Pro Val Asn 
160 

Gly Asn Gly Leu 
175 

Ser Asp Val Lys 
190 

He Ala Pro Leu 
205 

Asn Asn Trp His 
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210 215 220 

Leu Thr He He Asp Asp His Gly Asn Tyr He Ser Asp Lys He 
225 230 235 



<210> 288 
<211> 180 
<212> PRT 
<213> E. Coll 



<400> 288 
Met Lys Arg Ser 
1 

Ser Ala Gly Val 
20 

Lys Gly Asn He 
35 

Ser Val Ser He 
50 

Lys Ala Lys Tyr 
65 

Met Leu Lys He 

Leu He Thr Asn 
100 

Ser Asn Asp Thr 
115 

Val Leu Asp Val 
130 

Glu Gly Gin Tyr 
145 

Asp Asn Ser Val 

Val Ala Gin Asp 
180 



He He Ala Ala 
5 

Phe Ala Ala Asp 

Ala Glu Ser Pro 
40 

Asn Met Pro Thr 
55 

Ser Thr Tyr Asp 
70 

Ser Cys Pro Lys 
85 

Asp Lys He Thr 

Val Gly Tyr Tyr 
120 

Ser Ala Pro Phe 
135 

Ala He Pro Phe 
150 

Gin Ser Gly Asp 
165 



Ala Val Phe Ser 

10 

Val Asp Thr Gly 
25 

Cys Lys Phe Glu 

Val Pro Thr Ser 
60 

Asp Ala Val Gly 
75 

Glu Val Ala Gly 
90 

Gly Asn Asp Lys 
105 

Leu Tyr Leu Gly 

Asn He Glu Ser 
140 

Lys Ala Lys Tyr 
155 

Val Leu Ser Ser 
170 



Ser Phe Phe Met 
15 

Thr Leu Thr He 
30 

Ala Gly Gly Asp 
45 

Val Phe Glu Gly 

Val Thr Ser Ser 
80 

Val Lys Leu Ser 
95 

Ala He Ala Ser 
110 

Asp Asn Ser Asp 
125 

Tyr Lys Thr Ala 

Leu Lys Leu Thr 
160 

Leu Val Met Arg 
175 



<210> 289 
<211> 112 
<212> PRT 
<213> E. Coli 





<400> 


289 


























Met 


Ser 


Ser 


Glu 


Arg 


Asp 


Leu 


Val 


Asn 


Phe 


Leu 


Gly Asp 


Phe 


Ser 


Met 


1 








5 










10 










15 




Asp 


Val 


Ala 


Lys 


Ala 


Val 


He 


Ala 


Gly Gly Val 


Ala 


Thr 


Ala 


He 


Gly 








20 










25 










30 






Ser 


Leu 


Ala 
35 


Ser 


Phe 


Ala 


Cys 


Val 
40 


Ser 


Phe 


Gly 


Phe 


Pro 
45 


Val 


He 


Leu 


Val 


Gly 


Gly Ala 


He 


Leu 


Leu 


Thr 


Gly 


He 


Val 


Cys 


Thr 


Val 


Val 


Leu 




50 










55 










60 










Asn 


Glu 


He 


Asp 


Ala 


Gin 


Cys 


His 


Leu 


Ser 


Glu 


Lys 


Leu 


Lys 


Tyr 


Ala 


65 










70 










75 










80 


He 


Arg 


Asp 


Gly 


Leu 
85 


Lys 


Arg 


Gin 


Gin 


Glu 
90 


Leu 


Asp 


Lys 


Trp 


Lys 
95 


Arg 


Glu 


Asn 


Met 


Thr 


Pro 


Phe 


Met 


Tyr 


Val 


Leu 


Asn 


Thr 


Pro 


Pro 


Val 


He 
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100 



105 



110 



<210> 
<211> 
<212> 
<213> 

<400> 
Met Thr Asp 
1 

Phe Val Leu 

Lys Lys Leu 
35 

Met Thr Leu 
50 

lie Pro Leu 
65 

He Ala Val 

Pro Val Leu 

Asn Cys Ala 
115 

Asn Phe Leu 

130 
Ser Leu Val 
145 

Ala Asp Val 
Ala Gly Leu 
Leu 



290 
193 
PRT 

E, Coll 
290 

Tyr Leu 
5 

Val Lys 
20 

Glu Thr 

Ala Ser 

Asn Leu 

Val Val 
85 

Tyr Arg 
100 

Val Leu 

Gin Ser 

Met Val 

Pro Ala 
165 
Met Ser 
180 



Leu Leu Phe Val 
Phe Leu 
Ala Met 



He Cys 

55 
He Tyr 
70 

Gin Phe 

Leu Leu 

Gly Val 

Ala Leu 
135 
Leu Phe 
150 

Pro Phe 
Leu Ala 



Gly Leu 

25 
Gly Met 
40 

Ala Trp 



Leu Arg 

Thr Glu 

Gly He 
105 
Ala Leu 
120 

Tyr Gly 

Ala Ala 

Arg Gly 

Phe Met 
185 



Gly Thr 
10 

Cys Pro 

Gly Leu 

Leu He 

Thr Leu 
75 

Met Val 
90 

Phe Leu 

Leu Asn 

Phe Ser 

He Arg 
155 
Asn Ala 
170 

Gly Phe 



Val Leu 

Phe Met 

Ala Thr 
45 

Asp Thr 
60 

Ala Phe 

Val Arg 

Pro Leu 

He Asn 
125 
Ala Ala 
140 

Glu Arg 
He Ala 
Ser Gly 



Val Asn Asn 
15 

Gly Val Ser 
30 

Thr Phe Val 

Trp He Leu 

He Leu Val 
80 

Lys Thr Ser 
95 

He Thr Thr 
110 

Leu Gly His 

Val Gly Phe 

Leu Ala Val 
160 

Leu He Thr 
175 

Leu Val Lys 
190 



<210> 291 
<211> 192 
<212> PRT 
<213> E. Coli 





<400> 


291 
























Met 


Asn 


Ala 


He 


Trp 


He 


Ala 


Val Ala 


Ala 


Val 


Ser 


Leu 


Leu Gly Leu 


1 








5 








10 










15 




Ala 


Phe 


Gly Ala 


He 


Leu 


Gly 


Tyr Ala 


Ser 


Arg 


Arg 


Phe 


Ala 


Val 


Glu 








20 








25 










30 






Asp 


Asp 


Pro 


Val 


Val 


Glu 


Lys 


He Asp 


Glu 


He 


Leu 


Pro 


Gin 


Ser 


Gin 






35 










40 








45 








Cys 


Gly 


Gin 


Cys 


Gly 


Tyr 


Pro 


Gly Cys 


Arg 


Pro 


Tyr 


Ala 


Glu 


Ala 


He 




50 










55 








60 










Ser 


Cys 


Asn 


Gly 


Glu 


Lys 


He 


Asn Arg 


Cys 


Ala 


Pro 


Gly Gly 


Glu 


Ala 


65 










70 








75 










80 


Val 


Met 


Leu 


Lys 


He 


Ala 


Glu 


Leu Leu 


Asn 


Val 


Glu 


Pro 


Gin 


Pro 


Leu 










85 








90 










95 




Asp 


Gly 


Glu 


Ala 


Gin 


Glu 


He 


Thr Pro 


Ala 


Arg 


Met 


Val 


Ala 


Val 


He 
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100 

Asp Glu Asn Asn Cys lie Gly Cys 
115 120 
Val Asp Ala lie Val Gly Ala Thr 

130 135 
Asp Leu Cys Thr Gly Cys Asn Leu 
145 150 
Cys lie Ser Leu Gin Pro Val Ala 
165 

Asp Leu Asn Thr lie Pro Val Arg 
180 



105 110 
Thr Lys Cys lie Gin Ala Cys Pro 
125 

Arg Ala Met His Thr Val Met Ser 
140 

Cys Val Asp Pro Cys Pro Thr His 
155 160 
Glu Thr Pro Asp Ser Trp Lys Trp 

170 175 
lie lie Pro Val Glu His His Ala 
185 190 



<210> 292 
<211> 740 
<212> PRT 
<213> E. Coli 





<400> 


292 


























Met 


Leu 


Lys 


Leu 


Phe 


Ser 


Ala 


Phe 


Arg 


Lys 


Asn 


Lys 


He 


Trp 


Asp 


Phe 


1 








5 










10 










15 




Asn 


Gly 


Gly 


He 
20 


His 


Pro 


Pro 


Glu 


Met 
25 


Lys 


Thr 


Gin 


Ser 


Asn 
30 


Gly 


Thr 


Pro 


Leu 


Arg 


Gin 


Val 


Pro 


Leu 


Ala 


Gin Arg 


Phe 


Val 


He 


Pro 


Leu 


Lys 






35 










40 










45 








Gin 


His 


He 


Gly 


Ala 


Glu 


Gly 


Glu 


Leu 


Cys 


Val 


Ser 


Val 


Gly Asp 


Lys 




50 










55 










60 










Val 


Leu 


Arg 


Gly 


Gin 


Pro 


Leu 


Thr 


Arg 


Gly Arg 


Gly 


Lys 


Met 


Leu 


Pro 


65 










70 










75 










80 


Val 


His 


Ala 


Pro 


Thr 

85 


Ser 


Gly 


Thr 


Val 


Thr 
90 


Ala 


He 


Ala 


Pro 


His 
95 


Ser 


Thr 


Ala 


His 


Pro 
100 


Ser 


Ala 


Leu 


Ala 


Glu 
105 


Leu 


Ser 


Val 


He 


He 
110 


Asp 


Ala 


Asp 


Gly 


Glu 


Asp 


Cys 


Trp 


He 


Pro 


Arg 


Asp 


Gly Trp 


Ala 


Asp 


Tyr Arg 






115 










120 










125 








Thr 


Arg 


Ser 


Arg 


Glu 


Glu 


Leu 


He 


Glu 


Arg 


He 


His 


Gin 


Phe 


Gly Val 




130 










135 










140 










Ala 


Gly 


Leu 


Gly 


Gly Ala 


Gly 


Phe 


Pro 


Thr 


Gly Val 


Lys 


Leu 


Gin 


Gly 


145 










150 










155 










160 


Gly 


Gly Asp 


Lys 


He 


Glu 


Thr 


Leu 


He 


He 


Asn 


Ala 


Ala 


Glu 


Cys 


Glu 










165 










170 










175 




Pro 


Tyr 


He 


Thr 
180 


Ala 


Asp 


Asp 


Arg 


Leu 
185 


Met 


Gin 


Asp 


Cys 


Ala 
190 


Ala 


Gin 


Val 


Val 


Glu 
195 


Gly 


He 


Arg 


He 


Leu 
200 


Ala 


His 


He 


Leu 


Gin 
205 


Pro 


Arg 


Glu 


He 


Leu 
210 


He 


Gly 


He 


Glu 


Asp 
215 


Asn 


Lys 


Pro 


Gin 


Ala 
220 


He 


Ser 


Met 


Leu 


Arg Ala 


Val 


Leu 


Ala 


Asp 


Ser 


Asn Asp 


He 


Ser 


Leu 


Arg 


Val 


He 


Pro 


225 










230 










235 










240 


Thr 


Lys 


Tyr 


Pro 


Ser 
245 


Gly 


Gly 


Ala 


Lys 


Gin 
250 


Leu 


Thr 


Tyr 


He 


Leu 
255 


Thr 


Gly 


Lys 


Gin 


Val 


Pro 


His 


Gly 


Gly Arg 


Ser 


Ser 


Asp 


He 


Gly Val 


Leu 








260 










265 










270 






Met 


Gin 


Asn 


Val 


Gly Thr Ala 


Tyr 


Ala 


Val 


Lys 


Arg Ala 


Val 


He 


Asp 






275 










280 










285 








Gly 


Glu 
290 


Pro 


He 


Thr 


Glu 


Arg 
295 


Val 


Val 


Thr 


Leu 


Thr 
300 


Gly 


Glu 


Ala 


He 
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Ala Arg 
305 

Leu Leu 

Met Gly 

Val Val 

Gly Glu 
370 
Asp Ala 
385 

Lys Gly 

He Glu 

Val Gin 

Glu Glu 
450 
Ala Arg 
465 

Ala Ala 

Leu Ala 

lie Lys 

Glu Ala 
530 
Thr Asn 
545 

Ala Ala 

Asn Ala 

Ala Ala 

Asn Ala 
610 
Ala Ala 
625 

Asn Ala 

Ala Ala 

Asn Ala 

Ala Ala 
690 
Asn Ala 
705 

Ala Ala 
Val Asn 



Pro Gly 

Asn Asp 

Gly Pro 
340 
Lys He 
355 

Pro Gin 

Cys Pro 

Gin Gin 

Cys Gly 
420 
Tyr Phe 
435 

Lys Arg 

Leu Glu 

Val Gin 

Arg Val 
500 
Ala Gly 
515 

Arg Lys 

Asp Ala 

He Ala 

Glu Pro 
580 
He Ala 
595 

Glu Pro 

He Ala 

Glu Pro 

He Ala 
660 
Glu Pro 
675 

He Ala 

Val Pro 

He Ala 

Glu Asp 
740 



Asn Val 
310 
Ala Gly 
325 

Leu Met 

Thr Asn 

Glu Glu 

Ala Asp 
390 
His Asp 
405 

Ala Cys 

Arg Gin 

Ala Ala 

Arg Glu 
470 
Pro Ala 
485 

Lys Glu 

Glu Arg 

Ala Gin 

Ala Thr 
550 
Arg Ala 
565 

Glu Gin 

Arg Ala 

Glu Glu 

Arg Ala 
630 
Glu Gin 
645 

Arg Ala 

Glu Glu 

Arg Ala 

Glu Glu 
710 
Arg Ala 
725 



Trp Ala Arg Leu 
Phe Cys 
Gly Phe 



Cys Leu 
360 
Gin Ser 
375 

Leu Leu 

Lys Ala 

Ala Trp 

Glu Lys 
440 
Glu Ala 
455 

Lys Ala 

Ala Lys 

Lys Gin 

Pro Asp 
520 
Ala Arg 
535 

Val Ala 

Lys Ala 

Gin Val 

Lys Ala 
600 
Gin Val 
615 

Lys Ala 

Gin Val 

Lys Ala 

Gin Val 
680 
Lys Ala 
695 

Gin Val 
Gin Ala 



Pro Ser 
330 
Thr Leu 
345 

Leu Ala 



Cys He 

Pro Gin 

Thr Thr 
410 
Val Cys 
425 

Ala Glu 

Lys Ala 

Ala Arg 

Asp Lys 
490 
Ala Gin 
505 

Asn Ser 

Ala Lys 

Asp Pro 

Arg Lys 
570 
Asp Pro 
585 

Arg Lys 

Asp Pro 

Arg Lys 

Asp Pro 
650 
Arg Lys 
665 

Asp Pro 

Arg Lys 

Asp Pro 

Lys Lys 
730 



Gly Thr 
315 

Ala Asp 

Pro Trp 

Pro Ser 

Arg Cys 
380 
Gin Leu 
395 

His Asn 

Pro Ser 

He Ala 

Arg Phe 
460 
Leu Glu 
475 

Asp Ala 

Ala Thr 

Ala He 

Gin Ala 
540 
Arg Lys 
555 

Leu Glu 

Arg Lys 

Leu Glu 

Arg Lys 
620 
Leu Glu 
635 

Arg Lys 

Arg Glu 

Arg Lys 

Leu Glu 
700 
Arg Lys 
715 

Ala Ala 



Pro Val 

Gin Met 

Leu Asp 
350 
Ala Asn 
365 

Ser Ala 

Tyr Trp 

He Ala 

Asn He 
430 
Ala He 
445 

Glu Ala 

Arg His 

He Ala 

Gin Pro 
510 
He Ala 
525 

Glu Leu 

Thr Ala 

Gin Gin 

Ala Ala 
590 
Gin Gin 
605 

Ala Ala 

Gin Gin 

Ala Ala 

Gin Gin 
670 
Ala Ala 
685 

Gin Gin 
Ala Ala 
Gin Gin 



Arg His 
320 
Val He 
335 

Val Pro 

Glu Leu 

Cys Ala 

Phe Ser 
400 

Asp Cys 
415 

Pro Leu 

Arg Gin 

Arg Gin 

Lys Ser 
480 
Ala Ala 
495 

He Val 

Ala Arg 

Gin Gin 

Val Glu 
560 
Gin Ala 
575 

Val Glu 

Gin Ala 

Val Glu 

Gin Ala 
640 
Val Glu 
655 

Pro Ala 

Val Glu 

Gin Ala 

Val Ala 
720 
Lys Val 
735 
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<210> 293 
<211> 352 
<212> PRT 
<213> E. Coli 



<400> 293 
Met Val Phe Arg 
1 

Thr Ser Arg lie 
20 

Ala Ala Gin Leu 
35 

Leu Ala Ser Val 
50 

Arg Lys Gin Ser 
65 

Thr Gly Leu Leu 

Met Val Val Leu 
100 

Tyr Gly Gly Leu 
115 

Val Val Leu Leu 
130 

Pro His Glu lie 
145 

Val He Phe Ser 

Arg Leu Gly He 
180 

Lys Thr Ser Val 
195 

Pro He Tyr Ser 
210 

Leu Ala Trp Leu 
225 

Arg Trp His He 

Met Leu Gly Trp 
260 

His Leu Leu Ser 
275 

Asp Pro Val Thr 
290 

Ala Leu Ala Gly 
305 

Pro Asp Gly Val 

Leu He Asp Tyr 
340 



He Ala Ser Ser 
5 

Met Leu Leu Val 

Trp Phe Phe Gly 
40 

Ser Ala Leu Leu 
55 

Val Ala Ala Thr 
70 

Leu Ala Val Ser 
85 

Gly Thr Val Phe 

Gly Gin Asn Pro 
120 

He Ser Phe Pro 
135 

Ala Val Asn He 
150 

Gly His Thr Ala 
165 

Asp Gly He Ser 

Arg Ala Gly His 
200 

Gly He Leu Ala 
215 

Ala Gly Gly Val 
230 

Pro Leu Ser Phe 
245 

Leu Phe Ser Pro 

Gly Ala Thr Met 
280 

Ala Ser Thr Thr 
295 

Leu Leu Val Trp 
310 

Ala Phe Ala Val 
325 

Tyr Thr Arg Pro 



Pro Tyr Thr His 
10 

Leu Leu Ala Ala 
25 

Trp Gly Thr Leu 

Ala Glu Ala Leu 
60 

Leu Lys Asp Asn 
75 

He Pro Pro Leu 
90 

Ala Val He He 
105 

Phe Asn Pro Ala 

Val Gin Met Thr 
140 

Pro Gly Phe He 
155 

Ser Gly Gly Asp 
170 

Gin Ala Thr Pro 

185 

Ser Val Glu Gin 

Gly Ala Gly Trp 
220 

Trp Leu Leu Trp 
235 

Leu Val Thr Leu 
250 

Glu Thr Leu Ala 
265 

Leu Gly Ala Phe 

Asn Arg Gly Arg 
300 

Leu He Arg Ser 
315 

Leu Leu Ala Asn 
330 

Arg Val Tyr Gly 
345 



Asn Gin Arg Gin 
15 

Val Pro Gly He 
30 

Val Gin He Leu 
45 

Val Leu Lys Leu 

Ser Ala Leu Leu 

80 

Ala Pro Trp Trp 
95 

Ala Lys Gin Leu 
110 

Met He Gly Tyr 
125 

Ser Trp Leu Pro 

Asp Ala He Gin 
160 

Met Asn Thr Leu 
175 

Leu Asp Thr Phe 
190 

He Met Gin Tyr 
205 

Gin Trp Val Asn 

Gin Lys Ala He 
240 

Ala Leu Cys Ala 
255 

Ala Pro Gin He 
270 

Phe He Leu Thr 
285 

Leu He Phe Gly 

Phe Gly Gly Tyr 
320 

He Thr Val Pro 
335 

His Arg Lys Gly 
350 



<210> 294 
<211> 206 
<212> PRT 



-137- 



<213> E. Coli 



<400> 294 
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His 
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Phe 


Ala 


Gly 


Lys 


Lys 


He 


Ser 


Gly 


Ala 


Asp 


Asp 


Ala 


His 


Trp 
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Val 


14 0 










1 0 u 
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1 7 R 
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<400> 


295 




























Ser 


Glu 


He 


Lys 


rlop 


V dX 


X Xtr 


V d X 


Gin 


Gly 


Leu 
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Lys 


A QTi 
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5 










10 










1 S 
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Ala 


Leu 


Val 
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Thr 


Ala 


Thr 
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Leu 
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Leu 


Gly 


Leu 


fix d 


1 XiX 


1 IIX 


lieu 


1 

V dX 






35 










40 










45 








Lsu 


Thr 


Leu 


Thr 
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Leu 


1 XIX 


T 1 o 
X X t3 


Q -K- 
OSX 
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irp 


Th T- 
1 IIX 


IT X VJ 




50 
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60 












Glu 


He 


Arg 


He 


Jr X CJ 


X X c! 


lyr 


V d X 


Met 


He 


J. X 


Al 3 
rlX d 


OCX 


Vp^ 1 
V d X 


Vpi 1 
V dx 


65 










70 










75 










80 


Ser 


Ala 


Val 


Gin 


Met 


Leu 


He 


Asn 


Ala 


Tyr 


Ala 


Phe 


Gly 


Leu 


Tyr 


Gin 










85 










90 










95 




Ser 


Leu 


Gly 


He 


Phe 


He 


Pro 


Leu 


He 


Val 


Thr 


Asn 


Cys 


He 


Val 


Val 








100 










105 










110 






Gly 


Arg 


Ala 


Glu 


Ala 


Phe 


Ala 


Ala 


Lys 


Lys 


Gly 


Pro 


Ala 


Leu 


Ser 


Ala 






115 










120 










125 








Leu 


Asp 


Gly 


Phe 


Ser 


He 


Gly 


Met 


Gly 


Ala 


Thr 


Cys 


Ala 


Met 


Phe 


Val 




130 










135 










140 










Leu 


Gly 


Ser 


Leu 


Arg 


Glu 


He 


He 


Gly 


Asn 


Gly 


Thr 


Leu 


Phe 


Asp 


Gly 



145 150 155 160 
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Ala Asp Ala Leu Leu Gly Ser Trp 
165 

Phe His Thr Asp Ser Pro Phe Leu 
180 

Phe lie Gly Leu Gly Leu Met Leu 
195 200 
Arg Met Lys Lys Arg Arg Ala Glu 

210 215 
Asn Gly Glu Thr Gly Asn Val 
225 230 



Ala Lys Val Leu Arg Val Glu lie 

170 175 
Leu Ala Met Leu Pro Pro Gly Ala 
185 190 
Ala Gly Lys Tyr Leu lie Asp Glu 
205 

Ala Ala Ala Glu Arg Ala Leu Pro 
220 



<210> 296 
<211> 211 
<212> PRT 
<213> E. Coli 
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Glu 
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Ser 
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Glu 
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Val 
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Glu 
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Gly Val 
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He 


Gly 
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195 200 205 



Val Asp He 
210 



<210> 297 
<211> 167 
<212> PRT 
<213> E. Coli 

<400> 297 

Met Lys Arg Leu His Lys Arg Phe Leu Leu Ala Thr Phe Cys Ala Leu 

15 10 15 

Phe Thr Ala Thr Leu Gin Ala Ala Asp Val Thr He Thr Val Asn Gly 



20 



25 



30 



Arg Val Val Ala Lys Pro Cys Thr He Gin Thr Lys Glu Ala Asn Val 

35 40 45 

Asn Leu Gly Asp Leu Tyr Thr Arg Asn Leu Gin Gin Pro Gly Ser Ala 

50 55 60 

Ser Gly Trp His Asn He Thr Leu Ser Leu Thr Asp Cys Pro Val Glu 
65 70 75 80 

Thr Ser Ala Val Thr Ala He Val Thr Gly Ser Thr Asp Asn Thr Gly 

85 90 95 

Tyr Tyr Lys Asn Glu Gly Thr Ala Glu Asn He Gin He Glu Leu Arg 

100 105 110 

Asp Asp Gin Asp Ala Ala Leu Lys Asn Gly Asp Ser Lys Thr Val He 

115 120 125 

Val Asp Glu He Thr Arg Asn Ala Gin Phe Pro Leu Lys Ala Arg Ala 

130 135 140 

He Thr Val Asn Gly Asn Ala Ser Gin Gly Thr He Glu Ala Leu He 
145 150 155 160 

Asn Val He Tyr Thr Trp Gin 



<210> 298 
<211> 176 
<212> PRT 
<213> E. Coli 

<400> 298 

Met Lys Tyr Asn Asn He He Phe Leu Gly Leu Cys Leu Gly Leu Thr 

15 10 15 

Thr Tyr Ser Ala Leu Ser Ala Asp Ser Val He Lys He Ser Gly Arg 

20 25 30 

Val Leu Asp Tyr Gly Cys Thr Val Ser Ser Asp Ser Leu Asn Phe Thr 

35 40 45 

Val Asp Leu Gin Lys Asn Ser Ala Arg Gin Phe Pro Thr Thr Gly Ser 

50 55 60 

Thr Ser Pro Ala Val Pro Phe Gin He Thr Leu Ser Glu Cys Ser Lys 
65 70 75 80 

Gly Thr Thr Gly Val Arg Val Ala Phe Asn Gly He Glu Asp Ala Glu 

85 90 95 

Asn Asn Thr Leu Leu Lys Leu Asp Glu Gly Ser Asn Thr Ala Ser Gly 

100 105 110 

Leu Gly He Glu He Leu Asp Ala Asn Met Arg Pro Val Lys Leu Asn 

115 120 125 

Asp Leu His Ala Gly Met Gin Trp He Pro Leu Val Pro Glu Gin Asn 

130 135 140 

Asn He Leu Pro Tyr Ser Ala Arg Leu Lys Ser Thr Gin Lys Ser Val 
145 150 155 160 

Asn Pro Gly Leu Val Arg Ala Ser Ala Thr Phe Thr Leu Glu Phe Gin 



165 



165 



170 



175 



<210> 
<211> 
<212> 
<213> 



E. Coli 



299 
382 
PRT 
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<400> 299 
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Ser 


Ser 
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Asp 


Asn 
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265 










270 
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Val 


Val 
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lie 


Pro 
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His 


Gly Ala 
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280 










285 








He 


Ala 
290 


Arg 


Ala 


Thr 


Phe 


Asn 
295 


Ala 
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He 


Gly 
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Val 
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He 
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315 
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He 
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Val 
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Pro 


Gin 


Ser 
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Gin 
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Val 


Ser 


Trp 


Gly 
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345 










350 


Lys 


Asp 


Lys 
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Ser 
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He 
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Val 


Glu 
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<210> 300 
<211> 138 
<212> PRT 
<213> E. Coli 



<400> 300 
Met He Ala He 
1 

Val Ala Pro Phe 

20 

Ala Leu Glu Leu 
35 

Leu Leu Asp Leu 
50 

Pro Glu Phe Ala 
65 

He Ala Val He 

Asp Leu Pro Glu 
100 

Gly Lys Arg Lys 
115 

Phe Gly He Ser 
130 



Ala Asp He Leu 
5 

Leu Ala Gly He 

Val Asp His Leu 
40 

Val Cys Ala Lys 
55 

Glu Phe Asn Ala 
70 

Arg Thr Leu Met 
85 

He Gly Ser Lys 

Leu Thr Leu Glu 
120 

Pro Ala Leu Phe 
135 



Gin Ala Gly Glu 
10 

Gin Asn Glu Glu 
25 

Leu Leu Asn Asp 

He Thr Ala Trp 
60 

Met Ala Gin Ala 
75 

Asp Gin Tyr Gly 
90 

Ser Met Val Ser 
105 

His Ala Lys Lys 
He Asp 



Lys Leu Thr Ala 
15 

Gin Tyr Thr Gin 
30 

Pro Glu Asn Pro 
45 

Glu Glu Ser Ala 

Met Pro Gly Gly 
80 

Leu Thr Leu Ser 
95 

Arg Val Leu Ser 
110 

Leu Ala Thr Arg 
125 



<210> 301 
<211> 104 
<212> PRT 
<213> E. Coli 



<400> 301 
Met His Leu He 
1 

Pro Gin His Lys 
20 

Gly Tyr Phe Lys 
35 

Asp Asn Phe Lys 
50 

Asn Glu Leu Arg 
65 

Tyr He Arg Glu 

Val His Arg Thr 
100 



Thr Gin Lys Ala 
5 

Thr Glu Leu Val 

Lys Pro Glu Ser 
40 

Tyr Leu Asp Lys 
55 

Val Val Ala Met 
70 

Val Met Thr His 
85 

Lys Gly Lys Lys 



Leu Lys Asp Ala 
10 

Ala Leu Gly Asn 
25 

Leu Lys Ala Val 

His Tyr Val Phe 
60 

Val Phe Phe Glu 
75 

Lys Glu Tyr Asp 
90 



Ala Glu Lys Tyr 
15 

Thr He Ala Lys 
30 

Phe Pro Ser Leu 
45 

Asn Val Gly Gly 

Ser Gin Lys Cys 
80 

Phe Phe Thr Ala 
95 



<210> 302 
<211> 2383 
<212> PRT 
<213> E. Coli 



<400> 302 
Met Leu Ser Val Phe Thr Phe Phe 

1 5 
Met Leu Ala Arg Ser Gly Lys Val 
20 

Gly Glu Glu He Asn Asp Arg Gin 

35 40 
Leu Arg Arg Leu Thr Ala Gly He 



Arg Cys Ala Arg Lys Gly Ala Phe 

10 15 

Ser Met Ala Thr Lys Lys Arg Ser 

25 30 

He Leu Cys Gly Met Gly He Lys 
45 

Cys Leu He Thr Gin Leu Ala Phe 



-142- 



50 55 60 

Pro Met Ala Ala Ala Ala Gin Gly Val Val Asn Ala Ala Thr 
65 70 75 

Pro Val Pro Ala Gin He Ala He Ala Asn Ala Asn Thr Val 

85 90 
Thr Leu Gly Ala Leu Glu Ser Ala Gin Ser Val Ala Glu Arg 
100 105 110 

lie Ser Val Ala Glu Leu Arg Lys Leu Asn Gin Phe Arg Thr 

115 120 125 

Arg Ser Phe Asp Asn Val Arg Gin Gly Asp Glu Leu Asp Val 

130 135 140 

Gin Val Ser Glu Lys Lys Leu Thr Pro Pro Pro Gly Asn Ser 
145 150 155 

Asn Leu Glu Gin Gin He Ala Ser Thr Ser Gin Gin He Gly 

165 170 
Leu Ala Glu Asp Met Asn Ser Glu Gin Ala Ala Asn Met Ala 
180 185 190 

Trp Ala Ser Ser Gin Ala Ser Gly Ala Met Thr Asp Trp Leu 

195 200 205 

Phe Gly Thr Ala Arg He Thr Leu Gly Val Asp Glu Asp Phe 

210 215 220 

Lys Asn Ser Gin Phe Asp Phe Leu His Pro Trp Tyr Glu Thr 
225 230 235 

Asn Leu Phe Phe Ser Gin His Thr Leu His Arg Thr Asp Glu 

245 250 
Gin He Asn Asn Gly Leu Gly Trp Arg His Phe Thr Pro Thr 
260 265 270 

Ser Gly He Asn Phe Phe Phe Asp His Asp Leu Ser Arg Tyr 

275 280 285 

Arg Ala Gly He Gly Ala Glu Tyr Trp Arg Asp Tyr Leu Lys 

290 295 300 

Ser Asn Gly Tyr Leu Arg Leu Thr Asn Trp Arg Ser Ala Pro 
305 310 315 

Asp Asn Asp Tyr Glu Ala Arg Pro Ala Asn Gly Trp Asp Val 

325 330 
Glu Ser Trp Leu Pro Ala Trp Pro His Leu Gly Gly Lys Leu 
340 345 350 

Glu Gin Tyr Tyr Gly Asp Glu Val Ala Leu Phe Asp Lys Aso 

355 360 365 

Gin Ser Asn Pro His Ala He Thr Ala Gly Leu Asn Tyr Thr 

370 375 380 

Pro Leu Met Thr Phe Ser Ala Glu Gin Arg Gin Gly Lys Gin 
385 390 395 

Asn Asp Thr Arg Phe Ala Val Asp Phe Thr Trp Gin Pro Gly 

405 410 
Met Gin Lys Gin Leu Asp Pro Asn Glu Val Ala Ala Arg Arg 
420 425 430 

Ala Gly Ser Arg Tyr Asp Leu Val Asp Arg Asn Asn Asn He 

435 440 445 

Glu Tyr Arg Lys Lys Glu Leu Val Arg Leu Thr Leu Thr Asp 

450 455 460 

Thr Gly Lys Ser Gly Glu Val Lys Ser Leu Val Ser Ser Leu 
465 470 475 

Lys Tyr Ala Leu Lys Gly Tyr Asn Val Glu Ala Thr Ala Leu 

485 490 
Ala Gly Gly Lys Val Val Thr Thr Gly Lys Asp He Leu Val 
500 505 510 



Gin Gin 

80 
Pro Tyr 
95 

Phe Gly 

Phe Ala 

Pro Ala 

Ser Asp 
160 
Ser Leu 
175 

Arg Gly 

Ser Arg 

Ser Leu 

Pro Asp 
240 
Arg Thr 
255 

Trp Met 

His Ser 

Leu Ser 

Glu Leu 
320 
Arg Ala 
335 

Val Tyr 

Asp Arg 

Pro Phe 

Gly Glu 
400 
Ser Ala 
415 

Ser Leu 

Val Leu 

Pro Val 

Gin Thr 
480 
Glu Ala 
495 

Thr Leu 
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Pro Ala 

He Glu 
530 
Gin Ser 
545 

Ser Val 

Ala Thr 

Gly Leu 

Ser Asp 
610 
Thr Gly 
625 

Val Asp 

Ser Ser 

Ser Gly 

Lys Pro 
690 
Asp Asn 
705 

Gly Val 

Thr Ala 

Gly Phe 

Ser Ala 
770 
Val Ser 
785 

Thr Val 

Gin Asn 

Lys Ser 

Gly Val 
850 
Ala Gin 
865 

Asn Asp 

Leu Leu 

Lys Leu 

Thr Leu 
930 
Ser Ser 
945 

Ser Thr 



Tyr Arg Phe 
515 

Val Thr Ala 

Met Val Val 

Ser Leu Ser 
565 

Leu Thr Phe 
580 

Val Leu Ser 
595 

Trp Lys Asp 

Ala Met Ser 

Ala Ala Lys 
645 

Arg Thr His 
660 

Asn Pro He 
675 

Val Lys Glu 

Val Lys Pro 

Tyr Lys Ala 
725 

Lys Leu Leu 

740 
He He Asp 
755 

Ser Asn Asn 

Val Asn Val 

Thr Phe Ala 
805 

Thr Ala Lys 
820 

Ser Lys Gin 
835 

Lys Gin Thr 

Val Asp Leu 

Ser Val Thr 
885 

Asn Asp Val 
900 

Ser Gin Thr 
915 

Thr Ser Leu 
Gly Ser Gin 
Ala Ala Leu 



Thr Ser Thr Pro Glu Thr Asp Asn Thr Trp Pro 

520 525 
Glu Asp Val Lys Gly Asn Leu Ser Asn Arg Glu 

535 540 
Val Gin Ala Pro Thr Leu Ser Gin Lys Asp Ser 
550 555 560 

Thr Gin Thr Leu Asn Ala Asp Ser His Ser Thr 

570 575 
He Ala His Asp Ala Ala Gly Asn Pro Val Val 

585 590 
Thr Arg His Glu Gly Val Gin Asp He Thr Leu 

600 605 
Asn Gly Asp Gly Ser Tyr Thr Gin He Leu Thr 

615 620 
Gly Thr Leu Thr Leu Met Pro Gin Leu Asn Gly 
630 635 640 

Ala Pro Ala Val Val Asn He He Ser Val Ser 

650 655 
Ser Ser He Lys He Asp Lys Asp Arg Tyr Leu 

665 670 
Glu Val Thr Val Glu Leu Arg Asp Glu Asn Asp 

680 685 
Gin Lys Gin Gin Leu Asn Asn Ala Val Ser He 

695 700 
Gly Val Thr Thr Asp Trp Lys Glu Thr Ala Asp 
710 715 720 

Thr Tyr Thr Ala Tyr Thr Lys Gly Ser Gly Leu 

730 735 
Met Gin Asn Trp Asn Glu Asp Leu His Thr Ala 

745 750 
Ala Asn Pro Gin Ser Ala Lys He Ala Thr Leu 

760 765 
Gly Val Leu Ala Asn Glu Asn Ala Ala Asn Thr 

775 780 
Ala Asp Glu Gly Ser Asn Pro He Asn Asp His 
7^0 795 800 

Val Leu Ser Gly Ser Ala Thr Ser Phe Asn Asn 

810 815 
Thr Asp Val Asn Gly Leu Ala Thr Phe Asp Leu 

825 830 
Glu Asp Asn Thr Val Glu Val Thr Leu Glu Asn 

840 845 
Leu He Val Ser Phe Val Gly Asp Ser Ser Thr 

855 860 
Gin Lys Ser Lys Asn Glu Val Val Ala Asp Gly 
870 875 880 

Met Thr Ala Thr Val Arg Asp Ala Lys Gly Asn 

890 895 
Met Val Thr Phe Asn Val Asn Ser Ala Glu Ala 

905 910 
Glu Val Asn Ser His Asp Gly He Ala Thr Ala 

920 925 
Lys Asn Gly Asp Tyr Arg Val Thr Ala Ser Val 

935 940 
Ala Asn Gin Gin Val Asn Phe He Gly Asp Gin 
550 955 960 

Thr Leu Ser Val Pro Ser Gly Asp He Thr Val 
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965 970 975 

Thr Asn Thr Ala Pro Gin Tyr Met Thr Ala Thr Leu Gin Asp Lys Asn 

980 985 990 

Gly Asn Pro Leu Lys Asp Lys Glu He Thr Phe Ser Val Pro Asn Asp 

1000 1005 
Val Ala Ser Lys Phe Ser He Ser Asn Gly Gly Lys Gly Met Thr Asp 

1010 1015 1020 

Ser Asn Gly Val Ala He Ala Ser Leu Thr Gly Thr Leu Ala Gly Thr 
1025 1030 1035 1040 

His Met He Met Ala Arg Leu Ala Asn Ser Asn Val Ser Asp Ala Gin 

1045 1050 1055 

Pro Met Thr Phe Val Ala Asp Lys Asp Arg Ala Val Val Val Leu Gin 

1060 1065 1070 

Thr Ser Lys Ala Glu He He Gly Asn Gly Val Asp Glu Thr Thr Leu 

1075 1080 1085 

Thr Ala Thr Val Lys Asp Pro Ser Asn His Pro Val Ala Gly He Thr 

1090 1095 1100 

Val Asn Phe Thr Met Pro Gin Asp Val Ala Ala Asn Phe Thr Leu Glu 
1105 1110 1115 1120 

Asn Asn Gly He Ala He Thr Gin Ala Asn Gly Glu Ala His Val Thr 

1125 1130 1135 

Leu Lys Gly Lys Lys Ala Gly Thr His Thr Val Thr Ala Thr Leu Gly 

1140 1145 1150 

Asn Asn Asn Thr Ser Asp Ser Gin Pro Val Thr Phe Val Ala Asp Lys 

1155 1160 1165 

Ala Ser Ala Gin Val Val Leu Gin He Ser Lys Asp Glu He Thr Gly 

1170 1175 1180 

Asn Gly Val Asp Ser Ala Thr Leu Thr Ala Thr Val Lys Asp Gin Phe 
1185 1190 1195 1200 

Asp Asn Glu Val Asn Asn Leu Pro Val Thr Phe Ser Ser Ala Ser Ser 

1205 1210 1215 

Gly Leu Thr Leu Thr Pro Gly Val Ser Asn Thr Asn Glu Ser Gly He 

1220 1225 1230 

Ala Gin Ala Thr Leu Ala Gly Val Ala Phe Gly Glu Lys Thr Val Thr 

1235 1240 1245 

Ala Ser Leu Ala Asn Asn Gly Ala Ser Asp Asn Lys Thr Val His Phe 

1250 1255 1260 

He Gly Asp Thr Ala Ala Ala Lys He He Glu Leu Ala Pro Val Pro 
1265 1270 1275 1280 

Asp Ser He He Ala Gly Thr Pro Gin Asn Ser Ser Gly Ser Val He 

1285 1290 1295 

Thr Ala Thr Val Val Asp Asn Asn Gly Phe Pro Val Lys Gly Val Thr 

1300 1305 1310 

Val Asn Phe Thr Ser Asn Ala Ala Thr Ala Glu Met Thr Asn Gly Gly 

1315 1320 1325 

Gin Ala Val Thr Asn Glu Gin Gly Lys Ala Thr Val Thr Tyr Thr Asn 

1330 1335 1340 

Thr Arg Ser Ser He Glu Ser Gly Ala Arg Pro Asp Thr Val Glu Ala 
1345 1350 1355 1360 

Ser Leu Glu Asn Gly Ser Ser Thr Leu Ser Thr Ser He Asn Val Asn 

1365 1370 1375 

Ala Asp Ala Ser Thr Ala His Leu Thr Leu Leu Gin Ala Leu Phe Asp 

1380 1385 1390 

Thr Val Ser Ala Gly Glu Thr Thr Ser Leu Tyr He Glu Val Lys Asp 

1395 1400 1405 

Asn Tyr Gly Asn Gly Val Pro Gin Gin Glu Val Thr Leu Ser Val Ser 
1410 1415 1420 
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Pro Ser Glu Gly Val Thr Pro Ser Asn Asn Ala He Tyr Thr Thr Asn 
^425 1430 1435 1440 

His Asp Gly Asn Phe Tyr Ala Ser Phe Thr Ala Thr Lys Ala Gly Val 

1445 1450 1455 

Tyr Gin Leu Thr Ala Thr Leu Glu Asn Gly Asp Ser Met Gin Gin Thr 

1460 1465 1470 

Val Thr Tyr Val Pro Asn Val Ala Asn Ala Glu He Thr Leu Ala Ala 

1475 1480 1485 

Ser Lys Asp Pro Val He Ala Asp Asn Asn Asp Leu Thr Thr Leu Thr 

1490 1495 1500 

Ala Thr Val Ala Asp Thr Glu Gly Asn Ala He Ala Asn Thr Glu Val 
1505 1510 1515 1520 

Thr Phe Thr Leu Pro Glu Asp Val Lys Ala Asn Phe Thr Leu Ser Asp 

1525 1530 1535 

Gly Gly Lys Val He Thr Asp Ala Glu Gly Lys Ala Lys Val Thr Leu 

1540 1545 1550 

Lys Gly Thr Lys Ala Gly Ala His Thr Val Thr Ala Ser Met Thr Gly 

1555 1560 1565 

Gly Lys Ser Glu Gin Leu Val Val Asn Phe He Ala Asp Thr Leu Thr 

1570 1575 1580 

Ala Gin Val Asn Leu Asn Val Thr Glu Asp Asn Phe He Ala Asn Asn 
1585 1590 1595 1600 

Val Gly Met Thr Arg Leu Gin Ala Thr Val Thr Asp Gly Asn Gly Asn 

1605 1610 1615 

Pro Leu Ala Asn Glu Ala Val Thr Phe Thr Leu Pro Ala Asp Val Ser 

1620 1625 1630 

Ala Ser Phe Thr Leu Gly Gin Gly Gly Ser Ala He Thr Asp He Asn 

1635 1640 1645 

Gly Lys Ala Glu Val Thr Leu Ser Gly Thr Lys Ser Gly Thr Tyr Pro 

1650 1655 1660 

Val Thr Val Ser Val Asn Asn Tyr Gly Val Ser Asp Thr Lys Gin Val 
1665 1670 1675 1680 

Thr Leu He Ala Asp Ala Gly Thr Ala Lys Leu Ala Ser Leu Thr Ser 

1685 1690 1695 

Val Tyr Ser Phe Val Val Ser Thr Thr Glu Gly Ala Thr Met Thr Ala 

1700 1705 1710 

Ser Val Thr Asp Ala Asn Gly Asn Pro Val Glu Gly He Lys Val Asn 

1715 1720 1725 

Phe Arg Gly Thr Ser Val Thr Leu Ser Ser Thr Ser Val Glu Thr Asp 

1730 1735 1740 

Asp Arg Gly Phe Ala Glu He Leu Val Thr Ser Thr Glu Val Gly Leu 
1745 1750 1755 1760 

Lys Thr Val Ser Ala Ser Leu Ala Asp Lys Pro Thr Glu Val He Ser 

1765 1770 1775 

Arg Leu Leu Asn Ala Ser Ala Asp Val Asn Ser Ala Thr He Thr Ser 

1780 1785 1790 

Leu Glu He Pro Glu Gly Gin Val Met Val Ala Gin Asp Val Ala Val 

1795 1800 1805 

Lys Ala His Val Asn Asp Gin Phe Gly Asn Pro Val Ala His Gin Pro 

1810 1815 1820 

Val Thr Phe Ser Ala Glu Pro Ser Ser Gin Met He He Ser Gin Asn 
1825 1830 1835 1840 

Thr Val Ser Thr Asn Thr Gin Gly Val Ala Glu Val Thr Met Thr Pro 

1845 1850 1855 

Glu Arg Asn Gly Ser Tyr Met Val Lys Ala Ser Leu Pro Asn Gly Ala 

I860 1865 1870 

Ser Leu Glu Lys Gin Leu Glu Ala He Asp Glu Lys Leu Thr Leu Thr 
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1875 1880 1885 

Ala Ser Ser Pro Leu He Gly Val Tyr Ala Pro Thr Gly Ala Thr Leu 

1890 1895 1900 

Thr Ala Thr Leu Thr Ser Ala Asn Gly Thr Pro Val Glu Gly Gin Val 
1^05 1910 1915 1920 

He Asn Phe Ser Val Thr Pro Glu Gly Ala Thr Leu Ser Gly Gly Lys 

1925 ^ 1930 1935 

Val Arg Thr Asn Ser Ser Gly Gin Ala Pro Val Val Leu Thr Ser Asn 

1940 1945 1950 

Lys Val Gly Thr Tyr Thr Val Thr Ala Ser Phe His Asn Gly Val Thr 

1955 I960 1965 

He Gin Thr Gin Thr Thr Val Lys Val Thr Gly Asn Ser Ser Thr Ala 

1970 1975 1980 

His Val Ala Ser Phe He Ala Asp Pro Ser Thr He Ala Ala Thr Asn 
1985 1990 1995 2000 

Thr Asp Leu Ser Thr Leu Lys Ala Thr Val Glu Asp Gly Ser Gly Asn 

2005 2010 2015 

Leu He Glu Gly Leu Thr Val Tyr Phe Ala Leu Lys Ser Gly Ser Ala 

2020 2025 2030 

Thr Leu Thr Ser Leu Thr Ala Val Thr Asp Gin Asn Gly He Ala Thr 

2035 2040 2045 

Thr Ser Val Lys Gly Ala Met Thr Gly Ser Val Thr Val Ser Ala Val 

2050 2055 2060 

Thr Thr Ala Gly Gly Met Gin Thr Val Asp He Thr Leu Val Ala Gly 
2065 2070 2075 2080 

Pro Ala Asp Thr Ser Gin Ser Val Leu Lys Ser Asn Arg Ser Ser Leu 

2085 2090 2095 

Lys Gly Asp Tyr Thr Asp Ser Ala Glu Leu Arg Leu Val Leu His Asp 

2100 2105 2110 

He Ser Gly Asn Pro He Lys Val Ser Glu Gly Met Glu Phe Val Gin 

2115 2120 2125 

Ser Gly Thr Asn Val Pro Tyr He Lys He Ser Ala He Asp Tyr Ser 

2130 2135 2140 

Leu Asn He Asn Gly Asp Tyr Lys Ala Thr Val Thr Gly Gly Gly Glu 
2145 2150 2155 2160 

Gly He Ala Thr Leu He Pro Val Leu Asn Gly Val His Gin Ala Gly 

2165 2170 2175 

Leu Ser Thr Thr He Gin Phe Thr Arg Ala Glu Asp Lys lie Met Ser 

2180 2185 2190 

Gly Thr Val Ser Val Asn Gly Thr Asp Leu Pro Thr Thr Thr Phe Pro 

2195 2200 2205 

Ser Gin Gly Phe Thr Gly Ala Tyr Tyr Gin Leu Asn Asn Asp Asn Phe 

2210 2215 2220 

Ala Pro Gly Lys Thr Ala Ala Asp Tyr Glu Phe Ser Ser Ser Ala Ser 
2225 2230 2235 2240 

Trp Val Asp Val Asp Ala Thr Gly Lys Val Thr Phe Lys Asn Val Gly 

2245 2250 2255 

Ser Asn Ser Glu Arg He Thr Ala Thr Pro Lys Ser Gly Gly Pro Ser 

2260 2265 2270 

Tyr Val Tyr Glu He Arg Val Lys Ser Trp Trp Val Asn Ala Gly Glu 

2275 2280 2285 

Ala Phe Met He Tyr Ser Leu Ala Glu Asn Phe Cys Ser Ser Asn Glv 

2290 2295 2300 

Tyr Thr Leu Pro Arg Ala Asn Tyr Leu Asn His Cys Ser Ser Arq Gly 
2305 2310 2315 2320 

He Gly Ser Leu Tyr Ser Glu Trp Gly Asp Met Gly His Tyr Thr Thr 
2325 2330 2335 
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Asp Ala Giy Phe Gin Ser Asn Met Tyr Trp Ser Ser Ser Pro Ala Asn 

2340 2345 2350 

Ser Ser Glu Gin Tyr Val Val Ser Leu Ala Thr Gly Asp Gin Ser Val 

2355 2360 2365 

Phe Glu Lys Leu Gly Phe Ala Tyr Ala Thr Cys Tyr Lys Asn Leu 
2370 2375 2380 



<210> 303 
<211> 61 
<212> PRT 
<213> E. Coll 



<400> 303 
Met Ser Lys Gly Ala Leu Tyr Glu 

1 5 
He Pro Leu Pro His Lys His He 
20 

Ser Lys Asp Val Gly Tyr Ala Tyr 

35 40 
Leu Lys Thr His Ser Leu Arg Leu 
50 55 



Phe Asn Asn Pro Asp Gin Leu Lys 

10 15 
Ala Ser Thr Phe Asn Asp He Met 
25 30 
Val Ser Leu Leu Tyr Ala Cys Pro 
45 

Asn Pro Phe Ser Lys 
60 



<210> 304 
<211> 398 
<212> PRT 
<213> E. Coli 



<400> 304 
Met Gin Val Ala 
1 

Ala Val Ala Thr 
20 

Glu Arg Gin Lys 
35 

Asn Asp Pro Ala 
50 

Phe Gly Leu Thr 
65 

Ala Glu Val Thr 

Arg Glu Gin Thr 
100 

Trp Glu Trp Gin 
115 

Glu Lys Glu Gin 
130 

Asn Gly He Thr 
145 

Ser Lys Thr Arg 

He Glu Ala Arg 
180 

Lys Leu Lys Pro 
195 

Glu Leu Gly Tyr 



Glu Gin Arg He 
5 

Gin Asp Gly Pro 

Met Ser Ala Glu 
40 

Ala Gly Thr Thr 
55 

Ala Gly Trp His 
70 

Ala Arg Leu Gly 
85 

Arg Gin Leu Leu 

Thr Gin Ala Ala 
120 

Asn Thr He He 
135 

Ser Ser Val Glu 
150 

Gin Gin Leu Asn 
165 

Leu Ser Ala Leu 

Val Ala Leu Pro 
200 

Ser Leu Leu Ala 



Gin Leu Ala Glu 
10 

Gin He Asp Phe 
25 

Gly Leu Met Gly 

Gly Pro Trp Tyr 
60 

Leu Asp He Trp 
75 

Thr Val Lys Ala 
90 

Ala Gly Ser Val 
105 

Leu Asn Thr Val 

Ala Thr Asp Arg 
140 

Gly Val Glu Thr 
155 

Asp Val Ala Gly 
170 

Thr Asn Asn Gin 

185 

Lys Val Ala Ser 
Arg Arg Ala Asp 



Ala Gin Ala Lys 
15 

Ser Ala Asp Met 
30 

Pro Phe Ala Leu 
45 

Thr Asn Gly Thr 

Gly Lys Asn Arg 
80 

Arg Ala Ala Glu 
95 

Ala Arg Leu Tyr 
110 

Leu Gin Gin He 
125 

Gin Leu Tyr Gin 

Asp He Asn Ala 
160 

Lys Met Lys He 
175 

Thr Lys Ser Leu 
190 

Gin Leu Pro Asp 
205 

Leu Gin Ala Ala 
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210 










215 










220 










His 


Trp 


Tyr 


Val 


Glu 


Ser 


Ser 


Leu 


Ser 


Thr 


He 


Asp 


Ala 


Ala 


Lys 


Ala 


225 










230 










235 








240 


Ala 


Phe 


Tyr 


Pro 


Asp 


He 


Asn 


Leu 


Met 


Ala 


Phe 


Leu 


Gin 


Gin Asp Ala 










245 










250 










255 




Leu 


His 


Leu 


Ser 


Asp 


Leu 


Phe 


Arg 


His 


Ser 


Ala 


Gin 


Gin 


Met 


Gly Val 








260 










265 










270 






Thr 


Ala 


Gly 


Leu 


Thr 


Leu 


Pro 


He 


Phe 


Asp 


Ser 


Gly Arg 


Leu 


Asn 


Ala 






275 










280 










285 








Asn 


Leu 


Asp 


He 


Ala 


Lvs 


Ala 


Glu 


Ser 


Asn 


Leu 


Ser 


i_Le 


Ala 


Ser 


Tyr 




290 










295 










300 








Asn 


Lys 


Ala 


Val 


Val 


Glu 


Ala 


Val 


Asn 


Asp 


Val 


Ala 


Arg 


Ala 


Ala 


Ser 


305 










310 










315 










320 


Gin 


Val 


Gin 


Thr 


Leu 


Ala 


Glu 


Lys 


Asn 


Gin 


His 


Gin 


Ala 


Gin 


He 


Glu 










325 










330 










335 




Arg 


Asp 


Ala 


Leu 


Arg 


Val 


Val 


Gly Leu 


Ala 


Gin 


Ala 


Arg 


Phe 


Asn 


Ala 








340 




















350 






Glv 


lie 


He 


Ala 


Gly 


Ser 


Arg 


va± 


Ser 




r\J- d 


Arg 


Tin 

lie 


Pro 


Ala 


Leu 






355 










ooU 










"D /T C 








Arg 


Glu 


Arg 


Ala 


Asn 


Gly 


Leu 


Leu 


Leu 


o_Ln 




taJLn 


Trp 


Leu Asp Ala 




370 










375 










380 










Ser 


He 


Gin 


Leu 


Thr 


Gly Ala 


Leu 


Gly 


o±y 


bxy 


Tyr 


Lys 


Arg 






385 










390 










395 
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305 




























<211> 


96 




























<212> 


PRT 




























<213> 


E. Coli 


























<400> 


305 


























Met 


Tyr 


Cys 


His 


Ala 


Lvs 


Leu 


Lys 


o 


T1 p 

_L J_ C 


o t; J_ 


Gin 


His 


Thr 


Val 


ire 


1 








5 










1 0 










15 




Ser 


Ala 


His 


Leu 


Phe 


Leu 


Pro 


i-iop 


lyr 


Ser 


Pro 


Met 


Asn 


Arg Asp 


Ser 








20 










25 










30 






Phe 


Tyr 


Pro 


Ala 


He 


Ala 


Cys 


Phe 


Pro 


Leu 


Leu 


Leu 


Met 


Leu 


Ala 


Gly 






35 










40 










45 






Cys 

J: 


Ala 


Pro 


Met 


His 


Glu 


Thr 


Arg 


Gin 


Ala 


T .f=ii n 


C d -K~ 

o er 


Gin 


Gin 


Thr 


Pro 




50 










55 










oU 










Ala 


Ala 


Gin 


Val 


Asp 


Thr 


Ala 


Leu 


Pro 


rrTU 
I LIL 


/i±a 


Leu 


Lys 


Met 


Val 


Gly 


65 










70 


















80 


Gin 


Thr 


Ala 


Asn 


Gly 


Gly 


Trp 


Ser 


He 


1 liX 


X re 


T 1 o 

± re 


Asn 


Ser 


Leu 


Pro 










85 










90 










95 






<210> 


306 




























<211> 


315 




























<212> 


PRT 




























<213> 


E. Coli 


























<400> 


306 


























Met 


Arg 


Val 


Leu 


Leu 


Ala 


Pro 


Met 


Glu 


Gly Val 


Leu 


Asp 


Ser 


Leu 


Val 


1 








5 










10 








15 




Arg 


Glu 


Leu 


Leu 


Thr 


Glu 


Val 


Asn 


Asp 


Tyr Asp 


Leu 


Cys 


He 


Thr 


Glu 








20 










25 










30 






Phe 


Val 


Arg 


Val 


Val 


Asp 


Gin 


Leu 


Leu 


Pro 


Val 


Lys 


Val 


Phe 


His 


Arg 






35 










40 










45 
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He Cys 

50 
Val Arg 
65 

Ala Ala 

Gly Cys 

Leu Lys 

Ala Val 
130 
Asp Ser 
145 

Gly Ala 

Arg Ala 

Leu Asn 

Ala Gin 
210 
Arg Gly 
225 

Glu Pro 

Thr Arg 

He Lys 

Glu Leu 
290 
Arg Ala 
305 



Pro Glu Leu 

Val Gin Leu 

Arg Ala Val 
85 

Pro Ser Lys 
100 

Asp Pro Glu 
115 

Pro Ala His 

Gly Glu Lys 

Thr Glu Leu 
165 

Glu His He 

180 
He Pro Val 
195 

Gin Cys Met 

Ala Leu Asn 

Arg Met Pro 
245 

Leu Glu Lys 
260 

Gin Trp Leu 
275 

Phe Gin His 
He Gin Ala 



Gin Asn 
55 

Leu Gly 
70 

Glu Leu 

Thr Val 

Leu He 

Leu Pro 
135 
Lys Phe 
150 

Val Val 

Asp Trp 

He Ala 

Ala He 
215 
He Pro 
230 

Trp Pro 

Gin Gly 

Ser Tyr 

Val Arg 
295 
He Asp 
310 



Ala Ser Arg Thr 
Gin Phe 
Gly Ser 



Asn Gly 
105 
Tyr Gin 
120 

Val Ser 

Glu He 

His Gly 

Gin Ala 
185 
Asn Gly 
200 

Ser Gly 

Asn Leu 

Glu Val 

Asp Thr 
265 
Leu Arg 
280 

Val Leu 
He Glu 



Pro Gin 

75 
Trp Gly 
90 

Ser Gly 



Gly Ala 

Val Lys 

Ala Asp 
155 
Arg Thr 
170 

He Gly 

Glu He 

Cys Asp 

Ser Arg 
235 
Val Ala 
250 

Gly Leu 

Lys Glu 

Asn Asn 

Lys Leu 
315 



Pro 

60 

Trp 

Val 

Gly 

Lys 

Val 
140 
Ala 

Lys 

Asp 

Trp 

Ala 
220 
Val 

Leu 

Tyr 

Tyr 

Ser 
300 



Ser Gly Thr Leu 
Leu Ala 
Asp Leu 



Gly Ala 
110 
Ala Met 
125 

Arg Leu 

Val Gin 

Glu Gin 

He Arg 
190 
Asp Trp 
205 

Val Met 

Val Lys 

Leu Gin 

His Val 
270 
Asp Glu 
285 

Pro Asp 



Glu Asn 
80 

Asn Cys 
95 

Thr Leu 



Arg Glu 

Gly Trp 

Gin Ala 
160 
Gly Tyr 
175 

Gin Arg 

Gin Ser 

He Gly 

Tyr Asn 
240 
Lys Tyr 
255 

Ala Arg 
Ala Thr 
He Ala 



Met 

1 
Phe 

Pro 

Pro 

Leu 

65 

He 

Arg 

Glu 



<210> 
<211> 
<212> 
<213> 

<400> 
Thr He 

Leu Arg 

Ala Pro 
35 

Asn Ser 
50 

Trp Ser 
Glu His 
Tyr Ser 
Leu Pro 



307 
296 
PRT 

E. Coli 



307 

Ser Thr 
5 

His Pro 
20 

Leu Arg 



Thr 
Asp 
Lys 
Phe He Asp 



Val Lys 

Gin Ser 

85 
He Ala 
100 

Leu Val 



Thr 

70 

Lys 

Ala 

Leu 



Ser Thr Pro 

Thr Ala Arg 
25 

Leu Cys Asp 
40 

Glu Asp Leu 
55 

Gin Glu Gly 

Pro Glu Glu 

Met Gin Asn 
105 

Pro Met Leu 



His Asp Ala 
10 

Asp Phe He 

Leu Thr Thr 

Arg Gin Tyr 
60 

Val Gly Tyr 
75 

Leu Met Ala 

90 

His Leu Asp 
Phe Tyr His 



Val Phe Lys 
15 

Asp He His 
30 

Leu Lys Leu 
45 

Tyr Ser Asp 

He Tyr Val 

Phe Arg Met 
95 

Ala Gly Tyr 
110 

Gly Cys Arg 



Ser 

Leu 

Glu 

Leu 

Val 

80 

Met 

Lys 

Ser 
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115 










120 










125 








Pro 


Tyr 


Pro 


Tyr 


Ser 


Leu 


Cys 


Trp 


Leu 


Asp 


Glu 


Phe 


Ala 


Glu 


Pro 


Ala 




130 










135 










140 










He 


Ala 


Arg 


Lys 


He 


Tyr 


Ser 


Ser 


Ala 


Phe 


Pro 


Leu 


Val 


Asp 


He 


-L i i J. 


145 










150 










155 






X D U 


Val 


Val 


Pro 


Asp 


Asp 


Glu 


He 


Met 


Gin 


His 


Arq 


Lvs 


Met 


Ala 


Leu 


Leu 










165 










170 
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Glu 


Leu 


He 


Gin 


Lys 


His 


He 


Arg 


Gin 


Arg 


Asp 


Leu 


Leu 


Gly 


Leu 


Val 


Asp 






180 










185 










190 






Gin 


He 


Val 


Ser 


Leu 


Leu 


Val 


Thr 


Gly 


Asn 


Thr 


Asn 


Asp 


Arg 


Gin 






195 










200 










205 




Leu 


Lys 


Ala 


Leu 


Phe 


Asn 


Tyr 


Val 


Leu 


Gin 


Thr 


Glv 
^ j_ _y 


Asp 


Ala 


Gin 


Arg 




210 










215 










220 








Phe 


Arg Ala 


Phe 


He 


Gly 


Glu 


He 


Ala 


Glu 


Arg 


Ala 


Pro 


Gin 


Glu 


Lys 


225 










230 










235 










240 


Glu 


Lys 


Leu 


Met 


Thr 


He 


Ala 


Asp 


Arg 


Leu 


Arg 


Glu 


Glu 


Gly 


Ala 


Met 










245 










250 








255 




Gin 


Gly Lys 


His 


Glu 


Glu 


Ala 


Leu 


Arg 


He 


Ala 


Gin 


Glu 


Met 


Leu 


Asp 








260 










265 










270 




Arg 


Gly 


Leu 


Asp 


Arg 


Glu 


Leu 


Val 


Met 


Met 


Val 


Thr 


Arg 


Leu 


Ser 


Pro 



275 280 285 

Asp Asp Leu He Ala Gin Ser His 



290 295 



<210> 308 
<211> 555 
<212> PRT 
<213> E. Coli 

<400> 308 



<400> 3 






























Met 


Ala 


Gin 


Phe 


Val 


Tyr 


Thr 


Met 


His 


Arg 


Val 


Gly Lys 


Val 


Val 


Pro 


1 








5 










10 










15 




Pro 


Lys 


Arg 


His 


He 


Leu 


Lys 


Asn 


He 


Ser 


Leu 


Ser 


Phe 


Phe 


Pro 


Gly 








20 










25 










30 




Ala 


Lys 


He 


Gly 


Val 


Leu 


Gly 


Leu 


Asn 


Gly Ala 


Gly 


Lys 


Ser 


Thr 


Leu 






35 










40 








45 








Leu 


Arg 


He 


Met 


Ala 


Gly 


He 


Asp 


Lys 


Asp 


He 


Glu 


Gly 


Glu 


Ala 


Arg 




50 










55 










60 








Pro 


Gin 


Pro 


Asp 


He 


Lys 


He 


Gly 


Tyr 


Leu 


Pro 


Gin 


Glu 


Pro 


Gin 


Leu 


65 










70 










75 










80 


Asn 


Pro 


Glu 


His 


Thr 


Val 


Arg 


Glu 


Ser 


He 


Glu 


Glu 


Ala 


Val 


Ser 


Glu 


Val 








85 










90 










95 




Val 


Asn 


Ala 


Leu 


Lys 


Arg 


Leu 


Asp 


Glu 


Val 


Tyr 


Ala 


Leu 


Tyr 


Ala 








100 










105 










110 




Asp 


Pro 


Asp 


Ala 


Asp 


Phe 


Asp 


Lys 


Leu 


Ala 


Ala 


Glu 


Gin 


Gly 


Arg 


Leu 


Glu 




115 










120 










125 




Glu 


He 


He 


Gin 


Ala 


His 


Asp 


Gly 


His 


Asn 


Leu 


Asn 


Val 


Gin 


Leu 




130 










135 










140 








Glu 


Arg 


Ala 


Ala 


Asp Ala 


Leu 


Arg 


Leu 


Pro 


Asp 


Trp Asp 


Ala 


Lys 


He 


145 










150 










155 








160 


Ala 


Asn 


Leu 


Ser 


Gly Gly Glu 


Arg 


Arg 


Arg Val 


Ala 


Leu 


Cys 


Arg 


Leu 


Leu 


Leu 


Glu 




165 










170 










175 




Lys 


Pro 


Asp 


Met 


Leu 


Leu 


Leu 


Asp 


Glu 


Pro 


Thr 


Asn 


His 


Leu 






180 










185 










190 






Asp 


Ala 


Glu 


Ser 


Val 


Ala 


Trp 


Leu 


Glu 


Arg 


Phe 


Leu 


His 


Asp 


Phe 
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195 

Glu Gly Thr 

210 
Val Ala Gly 
225 

Glu Gly Asn 

Gin Glu Ala 

Leu Glu Trp 
275 

Ala Arg Leu 

290 
Arg Asn Glu 
305 

Asp Lys Val 

Leu Leu lie 

Gly He He 
355 

He Ser Gly 
370 

Thr Val Lys 
385 

Ser Lys Thr 

He Gly Asn 

Phe Lys Gly 
435 

Glu Arg Gly 
450 

Met Leu Leu 
465 

Arg Ala Leu 

He Ser His 

Asp Tyr Gin 
515 

Glu Tyr Glu 

530 
Pro Lys Arg 
545 



Val Val 

Trp He 

Tyr Ser 
245 
Ser Gin 
260 

Val Arg 

Ala Arg 

Thr Asn 

Leu Glu 
325 
Asp Asp 
340 

Gly Pro 

Gin Glu 

Leu Ala 

Val Trp 
405 
Thr Glu 
420 

Val Asp 

Arg Leu 

Leu Asp 

Glu Asn 
485 
Asp Arg 
500 

Asp Glu 
Glu Tyr 
He Lys 



Ala He 
215 
Leu Glu 
230 

Ser Trp 

Glu Ala 

Gin Gly 

Phe Glu 
295 
Glu Leu 
310 

Val Ser 

Leu Ser 

Asn Gly 

Gin Pro 
375 

Ser Val 
390 

Glu Glu 

Met Pro 

Gin Gly 

His Leu 
455 
Glu Pro 
470 

Ala Leu 

Trp Phe 

Gly Lys 

Lys Lys 
535 
Tyr Lys 
550 



200 

Thr His Asp 

Leu Asp Arg 

Leu Glu Gin 
250 

Ala Arg Arg 

265 
Thr Lys Gly 
280 

Glu Leu Asn 

Phe He Pro 

Asn Leu Arg 
330 

Phe Ser He 

345 
Ala Gly Lys 
360 

Asp Ser Gly 

Asp Gin Phe 

Val Ser Gly 
410 

Ser Arg Ala 

425 
Lys Arg Val 
440 

Ala Lys Leu 

Thr Asn Asp 

Leu Glu Phe 
490 

Leu Asp Arg 

505 
Val Glu Phe 
520 

Arg Thr Leu 
Arg He Ala 



Arg Tyr 
220 
Gly Glu 
235 

Lys Asp 

Lys Ser 

Arg Gin 

Ser Thr 
300 
Pro Gly 
315 

Lys Ser 

Pro Lys 

Ser Thr 

Thr He 
380 
Arg Asp 
395 

Gly Leu 

Tyr Val 

Gly Glu 

Leu Gin 
460 
Leu Asp 
475 

Pro Gly 

He Ala 

Phe Glu 

Gly Ala 
540 

Lys 



205 

Phe Leu 

Gly He 

Gin Arg 

He Glu 
270 
Ser Lys 
285 

Glu Tyr 

Pro Arg 

Tyr Gly 

Gly Ala 
350 
Leu Phe 
365 

Thr Leu 

Ser Met 

Asp He 

Gly Arg 
430 
Leu Ser 
445 

Val Gly 

He Glu 

Cys Ala 

Thr His 
510 
Gly Asn 
525 

Asp Ala 



Asp Asn 

Pro Trp 
240 
Leu Ala 
255 

Lys Glu 

Gly Lys 

Gin Lys 

Leu Gly 
320 
Asp Arg 
335 

He Val 

Arg Met 

Gly Glu 

Asp Asn 
400 
Met Lys 
415 

Phe Asn 

Gly Gly 

Gly Asn 

Thr Leu 
480 
Met Val 
495 

He Leu 
Phe Thr 
Leu Glu 



<210> 309 
<211> 173 
<212> PRT 
<213> E. Coli 

<400> 309 
Met Ser Lys Pro Lys Tyr Pro Phe 

1 5 
His Tyr Phe Thr Thr Asp Asp Gly 



Glu Lys Arg Leu Glu Val Val Asn 

10 15 
Tyr Arg He He Ser Ala Arg Phe 



-152- 



20 

Gly Val Pro Arg 
35 

His Gly Glu Lys 
50 

Pro Glu Leu Arg 
65 

Ser Leu Asn Gin 

Val Ala Arg Trp 
100 

Arg Ala Leu Lys 
115 

Pro Glu Lys Ala 
130 

Glu Asp Leu Glu 
145 

Leu Lys Lys Leu 



Thr Gin Val Arg 
40 

Gly Leu lie Pro 
55 

He Lys Val Val 
70 

Ala Ala Ala His 
85 

Leu Lys Val Tyr 

He Gly Thr Lys 
120 

Ala Ser Ala Leu 
135 

Arg Gin Val Arg 
150 

Lys Ala Leu Ala 
165 



25 

Thr Trp Val Ala 

Lys Pro Lys Gly 
60 

Lys Ala Val He 
75 

Phe Met Leu Ala 
90 

Glu Glu Arg Gly 
105 

Arg Asn He Ala 

Glu Leu Ser Lys 
140 

Phe Leu Glu Thr 
155 

His Pro Thr Lys 
170 



30 

Leu Tyr Glu Lys 
45 

Val Ser Ala Asp 

Glu Gin His Met 
80 

Gly Ser Gly Ser 
95 

Glu Ala Gly Leu 
110 

He Ser Val Asp 
125 

Asp Arg Arg He 

Arg Leu Met Tyr 
160 

Lys 



<210> 310 
<211> 283 
<212> PRT 
<213> E. Coli 



<400> 310 
Met Lys Val Leu 
1 

Leu Arg Ala Ala 
20 

Ala Leu Ser Lys 
35 

Glu He Tyr His 
50 

Leu Ser Leu His 
65 

Arg Leu Met Gly 

Tyr Arg Ser Tyr 
100 

Gin Arg Asp Phe 
115 

Val Thr Glu Phe 
130 

He Asp Leu Phe 
145 

Pro Val Met Asn 

Leu Asn Pro His 
180 

Tyr Arg Met Arg 
195 

Gin Ser Met Ser 
210 

Cys Phe Phe Gly 
225 



Asn Glu Leu Arg 
5 

Glu He Pro Arg 

Pro Asp Lys Tyr 
40 

Glu Asn Arg Gly 
55 

Arg Glu Gly Lys 
70 

Thr Leu Ser Leu 
85 

Arg Gly Glu Val 

Lys Ala Thr Arg 
120 

Ala Val Asn Gly 
135 

Asn Asn Glu Val 
150 

Met Val Glu Asn 
165 

Glu His Pro Val 

Arg Tyr Gin Asn 
200 

Arg Lys Gly Asn 
215 

Thr Leu Lys Ser 
230 



Gin Phe Tyr Pro 
10 

Ser Thr Phe Tyr 
25 

Ala Asp Val Lys 

Arg Tyr Gly Tyr 
60 

Gin He Asn His 
75 

Lys Ala Ala He 
90 

Gly Gin Thr Ala 
105 

Pro Asn Glu Lys 

Arg Lys Leu Tyr 
140 

He Ser Tyr Ser 
155 

Met Leu Asp Gin 
170 

Leu His Ser Asp 
185 

He Leu Lys Glu 

Cys Leu Asp Asn 
220 

Glu Cys Phe Tyr 
235 



Leu Asp Glu Leu 
15 

Tyr His Leu Lys 
30 

Lys Arg He Ser 
45 

Arg Arg Val Thr 

Lys Ala Val Gin 
80 

Lys Val Lys Arg 
95 

Pro Asn Val Leu 
110 

Trp Val Thr Asp 
125 

Leu Ser Pro Val 

Leu Ser Glu Arg 
160 

Ala Phe Lys Lys 
175 

Gin Gly Trp Gin 
190 

His Gly He Lys 
205 

Ala Val Val Glu 

Leu Asp Glu Phe 
240 
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Ser Asn He Ser Glu Leu Lys Asp Ala Val Thr Glu Tyr He Glu Tyr 

245 250 255 

Tyr Asn Ser Arg Arg lie Ser Leu Lys Leu Lys Gly Leu Thr Pro He 

260 265 270 

Glu Tyr Arg Asn Gin Thr Tyr Met Pro Arg Val 
275 280 

<210> 311 
<211> 38 
<212> PRT 
<213> E. Coli 

<400> 311 

Met Lys Val Arg Ala Ser Val Lys Lys Leu Cys Arg Asn Cys Lys He 

15 10 15 

Val Lys Arg Asp Gly Val He Arg Val He Cys Ser Ala Glu Pro Lys 

20 25 30 

His Lys Gin Arg Gin Gly 
35 



<210> 312 
<211> 443 
<212> PRT 
<213> E. Coli 



<400> 312 
Met Ala Lys Gin 
1 

Gly Glu Leu Lys 
20 

Phe Arg He Gly 
35 

Leu Ala Lys Leu 

50 

Asn Met Phe Ser 
65 

Gly He Met Pro 

Val Val His Pro 
100 

Arg Arg Lys He 
115 

He Phe Gin Ser 
130 

Met Gin Gly Leu 
145 

Val Val Ser Leu 

Gin He Thr Glu 
180 

Ala Gly He Val 
195 

Gin Ala Arg Gin 
210 

Val Leu Val Phe 
225 



Pro Gly Leu Asp 
5 

Arg Arg Leu Leu 

Ser Phe He Pro 
40 

Leu Glu Gin Gin 
55 

Gly Gly Ala Leu 
70 

Tyr He Ser Ala 
85 

Thr Leu Ala Glu 

Ser Gin Tyr Thr 
120 

He Gly He Ala 
135 

Val He Asn Pro 
150 

Val Thr Gly Thr 
165 

Arg Gly He Gly 

Ala Gly Leu Pro 
200 

Gly Asp Leu His 
215 

Ala Val Thr Phe 
230 



Phe Gin Ser Ala 
10 

Phe Val He Gly 
25 

He Pro Gly He 

Arg Gly Thr He 
60 

Ser Arg Ala Ser 
75 

Ser He He He 
90 

He Lys Lys Glu 
105 

Arg Tyr Gly Thr 

Thr Gly Leu Pro 
140 

Gly Phe Ala Phe 
155 

Met Phe Leu Met 
170 

Asn Gly He Ser 
185 

Pro Ala He Ala 

Phe Leu Val Leu 
220 

Phe Val Val Phe 
235 



Lys Gly Gly Leu 
15 

Ala Leu He Val 
30 

Asp Ala Ala Val 
45 

He Glu Met Phe 

He Phe Ala Leu 
80 

Gin Leu Leu Thr 
95 

Gly Glu Ser Gly 
110 

Leu Val Leu Ala 
125 

Asn Met Pro Gly 

Tyr Phe Thr Ala 
160 

Trp Leu Gly Glu 
175 

He He He Phe 
190 

His Thr He Glu 
205 

Leu Leu Val Ala 

Val Glu Arg Gly 
240 
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Gin Arg Arg lie 

Val Tyr Ala Ala 
260 

Gly Val He Pro 
275 

Thr He Ala Ser 
290 

Thr He Ser Leu 
305 

Tyr Ala Ser Ala 

Phe Asn Pro Arg 
340 

Val Pro Gly He 
355 

Val Met Thr Arg 
370 

Cys Leu He Pro 
385 

Phe Gly Gly Thr 

Met Ala Gin Val 
420 

Leu Lys Lys Ala 
435 



Val Val Asn Tyr 
245 

Gin Ser Thr His 

Ala He Phe Ala 
280 

Trp Phe Gly Gly 
295 

Tyr Leu Gin Pro 
310 

He He Phe Phe 
325 

Glu Thr Ala Asp 

Arg Pro Gly Glu 
360 

Leu Thr Leu Val 
375 

Glu Phe Met Arg 
390 

Ser Leu Leu He 
405 

Gin Thr Leu Met 

Asn Leu Lys Gly 
440 



Ala Lys Arg Gin 
250 

Leu Pro Leu Lys 
265 

Ser Ser He He 

Gly Thr Gly Trp 
300 

Gly Gin Pro Leu 
315 

Cys Phe Phe Tyr 
330 

Asn Leu Lys Lys 
345 

Gin Thr Ala Lys 

Gly Ala Leu Tyr 
380 

Asp Ala Met Lys 
395 

Val Val Val Val 
410 

Met Ser Ser Gin 
425 

Tyr Gly Arg 



Gin Gly Arg Arg 
255 

Val Asn Met Ala 
270 

Leu Phe Pro Ala 
285 

Asn Trp Leu Thr 

Tyr Val Leu Leu 
320 

Thr Ala Leu Val 
335 

Ser Gly Ala Phe 
350 

Tyr He Asp Lys 
365 

He Thr Phe He 

Val Pro Phe Tyr 
400 

He Met Asp Phe 
415 

Tyr Glu Ser Ala 
430 



<210> 313 
<211> 144 
<212> PRT 
<213> E. Coll 



<400> 
Met Arg Leu 
1 

Lys Arg Leu 

Arg Gly His 
35 

Gly Phe Glu 
50 

Gly Phe Thr 
65 

Asp Leu Ala 

Ala Ala Asn 

Ala Gly Glu 
115 

Lys Gly Ala 
130 



313 

Asn Thr Leu 
5 

Gly Arg Gly 
20 

Lys Gly Gin 

Gly Gly Gin 

Ser Arg Lys 
70 

Lys Val Glu 
85 

He He Gly 
100 

Val Thr Thr 
Arg Ala Ala 



Ser Pro Ala 

He Gly Ser 
25 

Lys Ser Arg 
40 

Met Pro Leu 
55 

Ala Ala He 

Gly Gly Val 

He Gin He 
105 

Pro Val Thr 

120 
He Glu Ala 
135 



Glu Gly Ser 
10 

Gly Leu Gly 

Ser Gly Gly 

Tyr Arg Arg 
60 

Thr Ala Glu 
75 

Val Asp Leu 
90 

Glu Phe Ala 

Val Arg Gly 

Ala Gly Gly 
140 



Lys Lys 

Lys Thr 

30 
Gly Val 
45 

Leu Pro 

He Arg 

Asn Thr 

Lys Val 
110 
Leu Arg 
125 

Lys He 



Ala Gly 
15 

Gly Gly 

Arg Arg 

Lys Phe 

Leu Ser 

80 
Leu Lys 
95 

He Leu 
Val Thr 
Glu Glu 



<210> 314 
<211> 59 
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<212> PRT 
<213> E. Coli 



<400> 314 
Met Ala Lys Thr lie Lys lie Thr 

1 5 
Leu Pro Lys His Lys Ala Thr Leu 
20 

Gly His Thr Val Glu Arg Glu Asp 

35 40 
Asn Ala Val Ser Phe Met Val Lys 
50 55 



Gin Thr Arg Ser Ala He Gly Arg 

10 15 
Leu Gly Leu Gly Leu Arg Arg He 
25 30 
Thr Pro Ala He Arg Gly Met He 
45 

Val Glu Glu 



<210> 315 
<211> 167 
<212> PRT 
<213> E. Coli 





<400> 


315 






















Met 


Ala 


His 


He 


Glu 


Lys 


Gin 


Ala 


Gly 


Glu 


Leu 


Gin Glu Lys 


Leu 


He 


1 








5 










10 




15 




Ala 


Val 


Asn 


Arg 


Val 


Ser 


Lys 


Thr 


Val 


Lys 


Gly Gly Arg He 


Phe 


Ser 








20 










25 






30 






Phe 


Thr 


Ala 


Leu 


Thr 


Val 


Val 


Gly Asp 


Gly Asn 


Gly Arg Val 


Gly 


Phe 






35 










40 








45 




Gly 


Tyr 
50 


Gly 


Lys 


Ala 


Arg 


Glu 
55 


Val 


Pro 


Ala 


Ala 


He Gin Lys 
60 


Ala 


Met 


Glu 


Lys 


Ala 


Arg 


Arg Asn 


Met 


He 


Asn 


Val 


Ala 


Leu Asn Asn 


Gly 


Thr 


65 










70 










75 




80 


Leu 


Gin 


His 


Pro 


Val 


Lys 


Gly Val 


His 


Thr 


Gly 


Ser Arg Val 


Phe 


Met 










85 










90 






95 




Gin 


Pro 


Ala 


Ser 


Glu 


Gly 


Thr 


Gly 


He 


He 


Ala 


Gly Gly Ala Met Arq 








100 










105 






110 






Ala 


Val 


Leu 


Glu 


Val 


Ala 


Gly Val 


His 


Asn 


Val 


Leu Ala Lys 


Ala 


Tyr 






115 










120 








125 




Gly 


Ser 


Thr 


Asn 


Pro 


He 


Asn 


Val 


Val 


Arg 


Ala 


Thr He Asp 


Gly 


Leu 




130 










135 










140 




Glu 


Asn 


Met 


Asn 


Ser 


Pro 


Glu 


Met 


Val 


Ala 


Ala 


Lys Arg Gly Lys 


Ser 


145 










150 










155 






160 


Val 


Glu 


Glu 


He 


Leu 
165 


Gly 


Lys 















<210> 316 
<211> 117 
<212> PRT 
<213> E. Coli 

<400> 316 
Met Asp Lys Lys Ser Ala Arg He 

1 5 
Lys Leu Gin Glu Leu Gly Ala Thr 
20 

Arg His He Tyr Ala Gin Val He 
35 40 



Arg Arg Ala Thr Arg Ala Arg Arg 

10 15 
Arg Leu Val Val His Arg Thr Pro 
25 30 
Ala Pro Asn Gly Ser Glu Val Leu 
45 
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Val Ala Ala 
50 

Thr Gly Asn 
65 

Arg Ala Leu 

Phe Gin Tyr 

Ala Gly Leu 
115 



Ser Thr Val Glu Lys Ala He Ala Glu Gin Leu Lys Tyr 

55 60 
Lys Asp Ala Ala Ala Ala Val Gly Lys Ala Val Ala Glu 

70 75 80 

Glu Lys Gly He Lys Asp Val Ser Phe Asp Arg Ser Gly 

85 90 95 

His Gly Arg Val Gin Ala Leu Ala Asp Ala Ala Arg Glu 
100 105 110 

Gin Phe 



<210> 
<211> 
<212> 
<213> 



317 
177 
PRT 

E. Coli 



<400> 
Met Ser Arg 
1 

Val Lys He 

Leu Thr Arg 
35 

Thr Leu Thr 
50 

Ala Gly Thr 
65 

Glu Gly Phe 

Ala Val Lys 

Val Asp His 
115 

Thr Glu He 
130 

Ala Ala Asp 
145 

Gly Val Arg 



317 

Val Ala Lys 
5 

Asn Gly Gin 
20 

Thr Leu Asn 

Phe Gly Pro 

Ala Arg Ala 
70 

Thr Lys Lys 
85 

Gly Asn Val 
100 

Gin Leu Pro 

Val Leu Lys 

Leu Arg Ala 
150 

Tyr Ala Asp 
165 



Ala Pro 

Val He 

Asp Ala 
40 

Arg Asp 
55 

Leu Leu 

Leu Gin 

He Asn 

Ala Gly 
120 
Gly Ala 
135 

Tyr Arg 
Glu Val 



Val Val 
10 

Thr He 
25 

Val Glu 

Gly Tyr 

Asn Ser 

Leu Val 
90 

Leu Ser 
105 

He Thr 

Asp Lys 

Arg Pro 

Val Arg 
170 



Val Pro Ala Gly 
Lys Gly 
Val Lys 



Ala Asp 
60 

Met Val 
75 

Gly Val 

Leu Gly 

Ala Glu 

Gin Val 
140 
Glu Pro 
155 

Thr Lys 



Lys Asn 

30 
His Ala 
45 

Gly Trp 



He Gly 

Gly Tyr 

Phe Ser 
110 
Cys Pro 
125 

He Gly 
Tyr Lys 
Glu Ala 



Val Asp 
15 

Gly Glu 

Asp Asn 

Ala Gin 

Val Thr 
80 

Arg Ala 
95 

His Pro 

Thr Gin 

Gin Val 

Gly Lys 
160 
Lys Lys 
175 



<210> 318 
<211> 130 
<212> PRT 
<213> E. Coli 

<400> 318 
Met Ser Met Gin Asp Pro He Ala 

1 5 
Gly Gin Ala Ala Asn Lys Ala Ala 
20 

Lys Val Ala He Ala Asn Val Leu 

35 40 
Phe Lys Val Glu Gly Asp Thr Lys 



Asp Met Leu Thr Arg He Arg Asn 

10 15 
Val Thr Met Pro Ser Ser Lys Leu 
25 30 
Lys Glu Glu Gly Phe He Glu Asp 
45 

Pro Glu Leu Glu Leu Thr Leu Lys 



-157- 



50 55 60 

Tyr Phe Gin Gly Lys Ala Val Val Glu Ser He Gin Arg Val Ser Arg 
65 70 75 80 

Pro Gly Leu Arg He Tyr Lys Arg Lys Asp Glu Leu Pro Lys Val Met 
85 90 95 

. Ala Gly Leu Gly He Ala Val Val Ser Thr Ser Lys Gly Val Met Thr 
100 105 110 

Asp Arg Ala Ala Arg Gin Ala Gly Leu Gly Gly Glu He He Cys Tyr 
115 120 125 

Val Ala 
130 



<210> 319 
<211> 101 
<212> PRT 
<213> E. Coli 





<400> 


319 










Met 


Ala 


Lys 


Gin 


Ser 


Met 


Lys 


Ala 


1 








5 








Ala 


Asp 


Lys 


Tyr 
20 


Phe 


Ala 


Lys 


Arg 


Asp 


Val 


Asn 
35 


Ala 


Ser 


Asp 


Glu 


Asp 
40 


Gin 


Thr 


Leu 


Pro 


Arg Asp 


Ser 


Ser 




50 










55 




Arg 


Gin 


Thr 


Gly 


Arg 


Pro 


His 


Gly 


65 










70 






Arg 


He 


Lys 


Val 


Arg 
85 


Glu 


Ala 


Ala 


Lys 


Lys 


Ala 


Ser 
100 


Trp 









Arg Glu Val Lys Arg Val Ala Leu 

10 15 
Ala Glu Leu Lys Ala He He Ser 
25 30 
Arg Trp Asn Ala Val Leu Lys Leu 
45 

Pro Ser Arg Gin Arg Asn Arg Cys 
60 

Phe Leu Arg Lys Phe Gly Leu Ser 

75 80 
Met Arg Gly Glu He Pro Gly Leu 
90 95 



<210> 320 
<211> 179 
<212> PRT 
<213> E. Coli 





<400> 


320 










Met 


Ala 


Lys 


Leu 


His 


Asp 


Tyr 


Tyr 


1 








5 








Met 


Thr 


Glu 


Phe 


Asn 


Tyr 


Asn 


Ser 








20 










Lys 


He 


Thr 


Leu 


Asn 


Met 


Gly Val 






35 










40 


Leu 


Leu 


Asp Asn 


Ala 


Ala 


Ala 


Asp 




50 










55 




Pro 


Leu 


He 


Thr 


Lys 


Ala 


Arg 


Lys 


65 










70 






Gin 


Gly 


Tyr 


Pro 


He 
85 


Gly 


Cys 


Lys 


Trp 


Glu 


Phe 


Phe 


Glu 


Arg 


Leu 


He 








100 










Asp 


Phe 


Arg 


Gly 


Leu 


Ser 


Ala 


Lys 



Lys Asp Glu Val Val Lys Lys Leu 

10 15 
Val Met Gin Val Pro Arg Val Glu 
25 30 
Gly Glu Ala He Ala Asp Lys Lys 
45 

Leu Ala Ala He Ser Gly Gin Lys 
60 

Ser Val Ala Gly Phe Lys He Arg 

75 80 
Val Thr Leu Arg Gly Glu Arg Met 

90 95 
Thr He Ala Val Pro Arg He Arg 
105 110 
Ser Phe Asp Gly Arg Gly Asn Tyr 



-158- 



115 120 125 



Ser 


Met 


Gly 


Val 


Arg 


Glu 


Gin 


He 


He 


Phe 


Pro 


Glu 


He 


Asp 


Tyr 


Asp 




130 










135 










IAD 










Lys 


Val 


Asp 


Arg 


Val 


Arg 


Gly Leu Asp 


He 


Thr 


He 


Thr 


Thr 


Thr 


Ala 


145 










150 










155 










160 


Lys 


Ser 


Asp 


Glu 


Glu 


Gly Arg Ala 


Leu 


Leu 


Ala 


Ala 


Phe 


Asp 


Phe 


Pro 










165 










170 










175 




Phe 


Arg 


Lys 






























<210> 


321Z 
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Ala Ala 
630 

Leu Thr 

Trp Val 

Arg Glu 

Trp Glu 
695 
Cys Pro 
710 

Gin Phe 

Arg Thr 

Ser Gly 

Tyr Gin 
775 
Ala Arg 
790 

Val Arg 

Thr Glu 

Lys His 

Asp He 
855 
Lys Pro 
870 

Leu Ala 
Arg Ala 
Glu Leu 
Leu Val 



Leu Asp 
505 
Asp Ala 
520 

Val Thr 



Leu Lys 

Gin Leu 

Thr Leu 
585 
Gin Tyr 
600 

Ala Leu 

Asp He 

Trp Leu 

Gin Met 
665 
Gly Arg 
680 

Leu Gly 

Met Asp 

Tyr Leu 

Ser Ser 
745 
Ala Ala 
760 

Lys Asn 

Val Val 

Arg Glu 

Val Arg 
825 
Arg Asp 
840 

Glu Phe 

Lys Leu 

Gin Asn 

Tyr Thr 
905 
Pro Gly 
920 

Arg Ala 



Glu 


Leu 


Asp 


Lys 


Arg 


490 










His 


Leu 


Met 


Pro 


His 










510 


Ala 


Val 


Thr 


Leu 


Ser 








525 




Arg 


Thr 


Thr 


Tvr 


Leu 






540 






His 


Leu 


He 


Ser 


Leu 




555 








Ala 


Arg 


Tvr 


Pro 


Leu 


570 










Tvr 


Gin 


Pro 


Thr 


Ala 










O w 


Leu 


Leu 


Arg 


V Cl_L 














Arg 


Gin 


Phe 


Lys 


Gin 






620 






Ala 


Gly 


Thr 


i_JC- u. 






635 








Ala 


Glu 


Ala 


Met 


X X c 


650 










Val 


Ala 


Arg 


Tvr 


Gly 










D / U 


Gly 


Phe 


Ala 


V Ct-L 


V d X 








O 0 O 




Tvr 


Ser 


Ser 


Asp 


Leu 






700 






Ala 


Met 


± iil. 




ijxy 




715 








Arg 


Leu 


Ala 


Gin 


Arrr 
iT.x y 


730 










Gly 


He 


Leu 


Tvr 


f^l n 

0_L U. 










7 R 0 


Gly 


J. it; L 


ijeu 


\7:=i 1 
V <dX 


i nr 








/ DO 




Glu 


Ala 


Trn 


Thr 


Trn 

X xp 






780 






Tvr 


Gly 


Asp 


Pro 


Gin 




795 








lie 


Met 


Thr 


Leu 


Prn 


810 










Glu 


Met 


Arg 


Glu 


Lys 










R n 
o o u 


Arg 


Phe 




-L _L t; 


Lys 








ft 4 R 
O -J 




He 


Thr 


Gin 


Tvr 

J. _y j_ 


Leu 






860 






Thr 


Arg 


Trn 


Ser 


Asp 




875 








Asp 


He 


Met 


Glu 


Glu 


890 










Thr 


Leu 


Arg 


Asp 


Glu 










910 


His 


Val 


Ser 


Glu 


Asp 








925 




Ser 


Trp 


Gin 


Lys 


Trp 



Thr He 
495 

Leu Leu 

Arg He 

Glu Leu 

Cys Ala 
560 
Leu Leu 
575 

Thr Asp 

Glu Asp 

Ala Gin 

Val Met 
640 
Asp Ala 
655 

Lys Pro 

Gly Tyr 

Asp Leu 

Glu Arg 
720 
He Met 
735 

Val Asp 

Ser Ala 

Glu His 

Leu Thr 
800 
Arg Glu 
815 

Met Arg 

Ala Asp 

Val Leu 

Asn Val 
880 
Gin Glu 
895 

Leu His 
Cys Phe 
Leu Val 
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930 935 940 

Glu Glu 
945 



<210> 327 
<211> 433 
<212> PRT 
<213> Coli 



<400> 327 



Met 


Ala 


Gin 


Glu 


He 


Glu 


Leu 


Lys 


I: lit; 


He 


Val 


Asn 


His 


Ser 


Ala 


val 


1 








5 










10 










15 




Glu 


Ala 


Leu Arg Asp 


His 


Leu 


Asn 


± ilJL 


Leu 


Gly Gly 




His 


His 


Asp 








90 
z u 










9 R 










30 




Pro 


Val 


ta-Ln 


Leu 


Leu 


Asn 


He 


1 yx 


lyr 


Glu 


Thr 


Pro 


Asp 


Asn 


Trp 


Leu 
































Arg 


Gly 


nlS 


Asp 


Met 


Gly 


Leu 




He 


Arg Gly Glu 


Asn 


Gly Arg 


Tyr 




50 










55 










60 








Glu 


Met 


1 nr 


iYier 


Lys 


Val 


Ala 


Gly 


Arg 


Val 


Thr 


Gly 


Liiy 


Leu 


His 


Gin 


65 










70 










75 








80 


Arg 


Pro 




lyr 


Asn 

o O 


Val 


Ala 


Leu 


Ser 


Glu 
90 


Pro 


Thr 


Leu 


Asp 


Leu 
95 


Ala 


Gin 


Leu 


rro 


Thr 


blU 


Val 


Trp 


P-ro 
n J, (J 


Asn 


Gly 


Glu 


Leu 


Pro 


Ala 


Asp 


Leu 








± u u 










105 










110 




Ala 


Ser 


Arg 


val 


bin 


Pro 


Leu 


IT lit: 


Ser 


Thr 


Asp 


Phe 


Tyr 


Arg 


Glu 


Lys 






lis 










-L ^ W 










125 




Trp 


Leu 


V a_L 


t\±cL 


V ai 


Asp 


Gly 




Gin 


He 


Glu 


He 


Ala 


Leu 


Asp 


Gin 




130 










135 










140 








Gly 


Glu 


Val 






Gly 


Glu 


Phe 


Ala 


Glu 


Pro 


He 


Cys 


Glu 


Leu 


Glu 


145 










150 










155 








160 


Leu 


Glu 


Leu 


Leu 


O t: J_ 


Gly Asp 


Thr 


Arg 


Ala 


Val 


Leu 


Lys 


Leu 


Ala 


Asn 










165 










170 










175 




Gin 


Leu 


Val 


Ser 


r^i n 

Vji_L 11 


Thr 


Gly 


Leu 


Arg 


Gin 


Gly 


Ser 


Leu 


Ser 


Lys 


Ala 








180 










185 










190 




Ala 


Arg 


Gly 


Tyr 


His 


Leu 


Ala 


Gin 


Gly Asn 


Pro 


Ala 


Arg 


Glu 


He 


Lys 






195 










200 










205 






Pro 


Thr 


Thr 


He 


Leu 


His 


Val 


Ala 


Ala 


Lys 


Ala 


Asp 


Val 


Glu 


Gin 


Glv 




210 










215 










220 








Leu 


Glu 


Ala 


Ala 


Leu 


Glu 


Leu 


Ala 


Leu 


Ala 


Gin 


Trp 


Gin 


Tyr 


His 


Glu 


225 










230 










235 








240 


Glu 


Leu 


Trp 


Val 


Arg 


Gly Asn 


Asp 


Ala 


Ala 


Lys 


Glu 


Gin 


Val 


Leu 


Ala 










245 










250 










255 




Ala 


lie 


Ser 


Leu 


Val 


Arg 


His 


Thr 


Leu 


Met 


Leu 


Phe 


Gly Gly 


He 


Val 








260 










265 










270 






Pro 


Arg 


Lys 


Ala 


Ser 


Thr 


His 


Leu 


Arg 


Asp 


Leu 


Leu 


Thr 


Gin 


Cys 


Glu 






275 










280 










285 






Ala 


Thr 


He 


Ala 


Ser 


Ala 


Val 


Ser 


Ala 


Val 


Thr 


Ala 


Val 


Tyr 


Ser 


Thr 




290 










295 










300 








Glu 


Thr 


Ala 


Met 


Ala 


Lys 


Leu 


Ala 


Leu 


Thr 


Glu 


Trp 


Leu 


Val 


Ser 


Lys 


305 










310 










315 










320 


Ala 


Trp 


Gin 


Pro 


Phe 


Leu 


Asp Ala 


Lys 


Ala 


Gin 


Gly 


Lys 


He 


Ser 


Asp 










325 










330 










335 


Ser 


Phe 


Lys 


Arg 
340 


Phe 


Ala 


Asp 


He 


His 
345 


Leu 


Ser 


Arg 


His 


Ala 
350 


Ala 


Glu 


Leu 


Lys 


Ser 


Val 


Phe 


Cys 


Gin 


Pro 


Leu 


Gly Asp Arg 


Tyr Arg Asp 


Gin 
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355 360 365 

Leu Pro Arg Leu Thr Arg Asp He Asp Ser He Leu Leu Leu Ala Gly 

370 375 380 

Tyr Tyr Asp Pro Val Val Ala Gin Ala Trp Leu Glu Asn Trp Gin Gly 
385 390 395 400 

Leu His His Ala He Ala Thr Gly Gin Arg He Glu He Glu His Phe 

405 410 415 

Arg Asn Glu Ala Asn Asn Gin Glu Pro Phe Trp Leu His Ser Gly Lys 
420 425 430 

Arg 



<210> 328 
<211> 70 
<212> PRT 
<213> E. Coli 



<400> 328 

Met Ser Gly Lys Met Thr Gly He Val Lys Trp Phe Asn Ala Asp Lys 

15 10 15 

Gly Phe Gly Phe He Thr Pro Asp Asp Gly Ser Lys Asp Val Phe Val 

20 25 30 

His Phe Ser Ala He Gin Asn Asp Gly Tyr Lys Ser Leu Asp Glu Gly 

35 40 45 

Gin Lys Val Ser Phe Thr He Glu Ser Gly Ala Lys Gly Pro Ala Ala 

50 55 60 

Gly Asn Val Thr Ser Leu 
65 70 



<210> 329 
<211> 523 
<212> PRT 
<213> E. Coli 



<400> 329 
Met Arg Asp He 
1 

Trp Asp Glu Leu 
20 

Val Asp Arg His 
35 

Ser Gin Ala Glu 
50 

Pro Ala Lys He 
65 

Lys Leu Phe Ser 

Arg Asp Leu Gin 
100 

Val Cys Ala Asn 
115 

Ser Trp Leu Asp 
130 



Val Asp Pro Val 
5 

Arg His Met Pro 

Glu Asp Ala He 
40 

Thr Ala His Val 
55 

Phe Gin Leu Asp 
70 

Met Leu Asn His 
85 

Cys Ser He Asp 

Asn Ala Trp Gin 
120 

Lys Met Asn Lys 
135 



Phe Ser He Gly 
10 

Ala Gly Gly Val 
25 

Ser Leu Ala Asn 

Ala Val He Ser 
60 

Asp Ser Gin Gly 
75 

Glu Lys Gly Leu 
90 

Pro His Asn Tyr 
105 

Asn He Pro Ala 

Trp Ser Arg Leu 
140 



He Ser Ser Leu 
15 

Trp Trp Phe Asn 
30 

Gin Thr He Ala 
45 

Met Asp Ser Asp 

Pro Glu Lys He 
80 

Tyr Tyr Leu Thr 
95 

Leu Phe He Leu 
110 

Glu Arg Leu Arg 
125 

Asn His Cys Ser 
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Leu 


Leu 


Val 


He 


Asn 


Pro 


Gly Asn Asn 


Asn 


Asp 


Lys 


Gin 


Phe 


Ser 


iJ C Lt 


145 










150 










155 












Leu 


Leu 


Glu 


Glu 


Tyr 


Arg 


Ser 


Leu 


Phe 


Gly Leu Ala 


Ser 


Leu 


Arg 


Phe 










165 










170 










175 




Gin 


Gly Asp 


Gin 


His 


Leu 


Leu 


Asp 


He 


Ala 


Phe 


1 rp 


Cys 


Asn 


Glu 


ijy s 








180 










185 










190 




Gly Val 


Ser 


Ala 


Arg 


Gin 


Gin 


Leu 


Ser 


Vp 1 
V dx 




k3 -Lii 


Gin 


Asn 


Gly 


He 






195 










200 










205 






Trp 


Thr 


Leu 


Val 


Gin 


Ser 


Glu 


Glu 


Ala 


o J.U 


ire 


Ljin 


Pro 


Arg 


Ser 


Asp 




210 










215 










220 






Glu 


Lys 


Arg 


He 


Leu 


Ser 


Asn 


Val 




vax 


Leu 


Pin 

blU 


Gly Ala 


Pro 


Pro 


225 










230 










9 ^ R 










240 


Leu 


Ser 


Glu 


His 


Trp 
245 


Gin 


Leu 


Phe 




250 




r 1 n 
olU 


Val 


Leu 


Phe 
255 


Asn 


Glu 


Ala 


Arg 


Thr 
260 


Ala 


Gin 


Ala 


Ala 


Thr 
265 


V d.X 


V ai 


Jrne 


Ser 


Leu 

270 


Gin 


Gin 


Asn 


Ala 


Gin 


He 


Glu 


Pro 


Leu 


Ala 


Arg 


C Q V~ 

oer 


lie 


nlS 


Thr 


Leu 


Arg Arg 






275 










280 










285 








Gin 


Arg 
290 


Gly 


Ser 


Ala 


Met 


Lys 
295 






va± 


Arg 


Glu 

o r\ r\ 


Asn 


Thr 


Ala 


Ser 


Leu Arg Ala 


Thr 


Asp 


Glu 




Leu 


Leu 


Leu 


Ala 


Cys 


Gly Ala Asn 


Met 


305 










310 










olo 










320 


Val 


He 


Pro 


Trp 


Asn 
325 


Ala 


Jr X O 


Leu 


oer 


Arg 
330 


Cys 


Leu 


Thr 


Met 


He 
335 


Glu 


Ser 


Val 


Gin 


Gly Gin 


Lys 


Phe 


Ser 


Arg 


Tyr 


val 


Pro 


Glu 


Asp 


He 


Thr 








340 










345 










350 






Thr 


Leu 


Leu 


Ser 


Met 


Thr 


Gin 


Pro 


Leu 


Lys 


Leu 


Arg 


Gly 


Phe 


Gin 


Lys 






355 










360 










365 






Trp Asp Val 


Phe 


Cys 


Asn 


Ala 


Val 


Asn 


Asn 


ixiet 


Met 


Asn 


Asn 


Pro 


Leu 




370 










375 










T Q ri 
o o U 










Leu 


Pro 


Ala 


His 


Gly 


Lys 


Gly 


Val 


Leu 


val 


Ala 


Leu 


Arg 


Pro 


Val 


Pro 


385 










390 


















400 


Gly 


He 


Arg 


Val 


Glu 
405 


Gin 


Ala 


Leu 


Thr 


Leu 

AID 
^ J. u 


Cys 


Arg 


Pro 


Asn 


Arg 
415 


Thr 


Gly Asp 


He 


Met 


Thr 


He 


Gly Gly Asn 


Arg 


Leu 


val 


Leu 


Phe 


Leu 


Ser 








420 










425 










430 






Phe 


Cys 


Arg 
435 


He 


Asn 


Asp 


Leu 


Asp 
440 


Thr 


Ala 


Leu 


Asn 


His 
445 


He 


Phe 


Pro 


Leu 


Pro 


Thr 


Gly Asp 


He 


Phe 


Ser 


Asn 


Arg 


Met 


Val 


Trp 


Phe 


Glu 


Asp 




450 










455 










460 








Asp 


Gin 


He 


Ser 


Ala 


Glu 


Leu 


Val 


Gin 


Met 


Arg 


Leu 


Leu 


Ala 


Pro 


Glu 


465 










470 










475 










480 


Gin 


Trp 


Gly Met 


Pro 


Leu 


Pro 


Leu 


Thr 


Gin 


Ser 


Ser 


Lys 


Pro 


Val 


He 










485 










490 










495 




Asn 


Ala 


Glu 


His 


Asp 


Gly Arg 


His 


Trp 


Arg Arg 


He 


Pro 


Glu 


Pro 


Met 








500 










505 










510 






Arg 


Leu 


Leu Asp Asp 


Ala 


Val 


Glu 


Arg 


Ser 


Ser 













515 520 



<210> 330 
<211> 62 
<212> PRT 
<213> E. Coli 

<400> 330 

Met Thr He Ser Asp He He Glu He He Val Val Cys Ala Leu He 
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1 

Phe Phe Pro Leu 
20 

Asp Thr Leu Arg 
35 

Gly Thr Leu Arg 
50 



5 

Gly Tyr Leu Ala 

Leu Phe Phe Ala 
40 

Arg Thr Glu Lys 
55 



10 

Arg His Ser Leu 
25 

Lys Pro Arg Tyr 

Ala Arg Ala Thr 
60 



15 

Arg Arg lie Arg 
30 

Val Lys Pro Ala 
45 

Lys Lys 



<210> 331 
<211> 559 
<212> PRT 
<213> E, Coll 

<400> 331 

Met Thr Gin Phe Thr Gin Asn Thr Ala Met Pro Ser Ser Leu Trp Gin 

15 10 15 

Tyr Trp Arg Gly Leu Ser Gly Trp Asn Phe Tyr Phe Leu Val Lys Phe 

20 25 30 

Gly Leu Leu Trp Ala Gly Tyr Leu Asn Phe His Pro Leu Leu Asn Leu 

35 40 45 

Val Phe Ala Ala Phe Leu Leu Met Pro Leu Pro Arg Tyr Ser Leu His 

50 55 60 

Arg Leu Arg His Trp He Ala Leu Pro He Gly Phe Ala Leu Phe Trp 
65 70 75 80 

His Asp Thr Trp Leu Pro Gly Pro Glu Ser lie Met Ser Gin Gly Ser 

85 90 95 

Gin Val Ala Gly Phe Ser Thr Asp Tyr Leu He Asp Leu Val Thr Arg 

100 105 110 

Phe He Asn Trp Gin Met He Gly Ala He Phe Val Leu Leu Val Ala 

115 120 125 

Trp Leu Phe Leu Ser Gin Trp He Arg He Thr Val Phe Val Val Ala 

130 135 140 

He Leu Leu Trp Leu Asn Val Leu Thr Leu Ala Gly Pro Ser Phe Ser 
145 150 155 160 

Leu Trp Pro Ala Gly Gin Pro Thr Thr Thr Val Thr Thr Thr Gly Gly 

165 170 175 

Asn Ala Ala Ala Thr Val Ala Ala Thr Gly Gly Ala Pro Val Val Gly 

180 185 190 

Asp Met Pro Ala Gin Thr Ala Pro Pro Thr Thr Ala Asn Leu Asn Ala 

195 200 205 

Trp Leu Asn Asn Phe Tyr Asn Ala Glu Ala Lys Arg Lys Ser Thr Phe 

210 215 220 

Pro Ser Ser Leu Pro Ala Asp Ala Gin Pro Phe Glu Leu Leu Val He 
225 230 235 240 

Asn He Cys Ser Leu Ser Trp Ser Asp He Glu Ala Ala Gly Leu Met 

245 250 255 

Ser His Pro Leu Trp Ser His Phe Asp He Glu Phe Lys Asn Phe Asn 

260 265 270 

Ser Ala Thr Ser Tyr Ser Gly Pro Ala Ala He Arg Leu Leu Arg Ala 

275 280 285 

Ser Cys Gly Gin Thr Ser His Thr Asn Leu Tyr Gin Pro Ala Asn Asn 

290 295 300 

Asp Cys Tyr Leu Phe Asp Asn Leu Ser Lys Leu Gly Phe Thr Gin His 
305 310 315 320 

Leu Met Met Gly His Asn Gly Gin Phe Gly Gly Phe Leu Lys Glu Val 
325 330 335 
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Arg 


Glu 


Asn 


Gly 


Gly Met 


Gin 


Ser 








340 










Pro 


Val 


He 


Leu 


Leu Gly 


Phe 


Asp 






355 










360 


Ala 


Val 


Leu 


Asn 


Arg 


Trp 


Leu 


Asp 




370 










375 




Arg 


Ser 


Ala 


Thr 


Phe 


Tyr 


Asn 


Thr 


385 










390 






Tyr 


Pro 


Gly 


Val 


Ser 


Lys 


Thr 


Ala 










405 








Phe 


Phe 


Asp 


Glu 


Leu Asp 


Ala 


Phe 








420 










Arg 


Lys 


Val 


Met 


Val 


Val 


Val 


Val 






435 










440 


Gly 


Asp 


Arg 


Met 


Pin 


V ax 


Ser 


Gly 




450 










455 




He 


Thr 


Asp 


Val 


Pro 


Val 


Glv 


Val 


465 










470 






His 


Gin 


Glv 


Ala 


Pro 


ire 


Val 


He 










485 








He 


Ser 


Asp 


Leu 


Val 


V a j- 


Arg 


Val 








500 










Asp 


Asn 


Val 


Asp 


Trp 


Lys 


Lys 


Leu 






515 










520 


Pro 


Val 


Ser 


Glu 


Asn 






rl_L d 




530 










J _) J 




Pro 


Tyr 


Val 


Arg 


Leu 








545 


















<210> 


332 












<211> 


127 












<212> 


PRT 












<213> 


E. Coll 










<400> 


332 










Met 


Glu 


Gly 


Ser 


Arg 


Met 


Lys 


Tvr 


1 








5 








Phe 


Ala 


Leu 


Ser 


Ala 


Gly 


Ser 


Tvr 








20 










Glu 


Gin 


Asn 


He 


Leu 


Lys 


Glu 


He 






35 










40 


Gin 


Asn 


Arg 


He 


Asp 


Gly 


Leu 


Asn 




50 










55 




Asn 


Cys 


Ser 


Asp 


Ser 


Gin 


Leu 


Arg 


65 










70 






Lys 


Gin 


Lys 


Asp 


Glu 
85 


Val 


Ala 


Glu 


Lys 


Gin 


Lys 


Gly 


Asp 


Ala 


Asp 


Lys 








100 










Ala 


Glu 


Ala 


Gin 


Glu 


Glu 


Leu 


Lys 






115 










120 



Glu 


Leu 


Met 


Asp 


Gin 


Thr 


Asn 


Leu 


345 










350 






Gly 

J: 


Ser 


Pro 


Val 


Tyr Asp 


Asp 


Thr 










365 








Val 


Thr 


Glu 


Lys 


Asp 


Lys 


Asn 


Ser 








380 










Leu 


Pro 


Leu 


His 


Asp 


Gly 


Asn 


His 






395 










4 00 


Asp 


Tyr 


Lys 


Ala 


Arg Ala 


Gin 


Lys 




410 










415 




Phe 


Thr 


Glu 


Leu 


Glu 


Lys 


Ser 


Gly 


425 










430 






Pro 


Glu 


His 


Gly Gly Ala 


Leu 


Xiy s 










445 








Leu 


Arg Asp 


He 


Pro 


Ser 




O t;X 








460 










Lys 


Phe 


Phe 


Gly Met 


Lys 










475 










A R n 

4 o U 


Glu 


Gin 


Pro 


Ser 


Ser 


Phe 


Leu 






490 










4 Q R 




Leu 


Asp 


Gly 


Lys 


He 


Phe 


Thr 


Glu 


505 










510 






Thr 


Ser 


Gly 


Leu 


Pro 


Gin 


Thr 


Ala 










525 








Val 


Val 


He 


Gin 


Tyr 


Gin 


Asp 


Lys 








540 










Asp 


Trp 


Val 


Pro 


Tyr 


Pro 


Gin 





555 



Arg 


He 


Ala 


Leu 


Ala 


Val 


Ser 


Leu 




10 










15 




Ala 


Thr 


Thr 


Leu 


Cys 


Gin 


Glu 


Lys 


25 










30 






Ser 


Tyr 


Ala 


Glu 


Lys 


His 


Gin 


Asn 










45 








Lys 


Ala 


Leu 


Ser 


Glu 


Val 


Arg Ala 








60 










Ala 


Asp 


His 


Gin 


Lys 


Lys 


He 


Ala 






75 










80 


Arg 


Gin 


Gin Asp 


Leu 


Ala 


Glu 


Ala 




90 










95 




He 


Ala 


Lys 


Arg 


Glu 


Arg 


Lys 


Leu 


105 










110 






Lys 


Leu 


Glu 


Ala 


Arg Asp 


Tyr 





125 



<210> 333 
<211> 101 
<212> PRT 



<213> E. Coli 





<400> 


333 




















Met 


Ser 


Lys 


Glu 


His 


Thr 


Thr Glu 


His 


Leu Arg Ala 


Glu 


Leu 


Lys 


Ser 


1 








5 








10 






15 




Leu 


Ser 


Asp 


Thr 


Leu 


Glu 


Glu Val 


Leu 


Ser Ser Ser 


Gly Glu 


Lys 


Ser 








20 








25 






30 






Lys 


Glu 


Glu 


Leu 


Ser 


Lys 


He Arg 


Ser 


Lys Ala Glu 


Gin 


Ala 


Leu 


Lys 






35 








40 






45 






Gin 


Ser 


Arg 


Tyr 


Arg 


Leu 


Gly Glu 


Thr 


Gly Asp Ala 


He 


Ala 


Lys 


Gin 




50 










55 




60 








Thr Arg 


Val 


Ala 


Ala 


Ala 


Arg Ala 


Asp 


Glu Tyr Val 


Arg 


Glu 


Asn 


Pro 


65 










70 






75 








80 


Trp 


Thr 


Gly 


Val 


Gly 


He 


Gly Ala 


Ala 


He Gly Val 


Val 


Leu 


Gly Val 
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Glu Ser Asp Arg He Thr His Arg Thr Val Asn Gly Glu Pro Ala Glu 
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Gin Trp Gin Tyr Asp Gly His Gly Trp Leu Thr Asp He Ser His Leu 
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Ser Glu Gly His Arg Val Ala Val His Tyr Gly Tyr Asp Asp Lys Gly 
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Arg Leu Thr Gly Glu Cys Gin Thr Val Glu Asn Pro Glu Thr Gly Glu 
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Leu Leu Trp Gin His Glu Thr Lys His Ala Tyr Asn Glu Gin Gly Leu 
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Ala Asn Arg Val Thr Pro Asp Ser Leu Pro Pro Val Glu Trp Leu Thr 
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20 
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300 










Ala 


Arg 


Gly 


Thr 


Ala 


Gin 


Val 


Thr 


He 


Lys 


Gin 


Asn 


Gly 


Tyr 


Asp 


He 


305 










310 










315 










320 


Tvr 


Asn 


Ser 


Thr 


Val 


Pro 


Pro 


Gly 


Pro 


Phe 


Thr 


He 


Asn Asp 


He 


Tvr 










325 










330 










335 




Ala 


Ala 


Gly 
J. J 


Asn 


Ser 


Gly 


Asp 


Leu 


Gin 


Val 


Thr 


He 


Lys 


Glu 


Ala 


Asp 








340 










345 










350 






Gly 


Ser 


Thr 


Gin 


He 


Phe 


Thr 


Val 


Pro 


Tyr 


Ser 


Ser 


Val 


Pro 


Leu 


Leu 






355 










360 










365 








Gin 


Arg 


Glu 


Gly 


His 


Thr 


Arg 


Tyr 


Ser 


He 


Thr 


Ala 


Glv 


Glu 


Tvr 


Arg 




370 










375 










380 










Ser 


Gly 


Asn 


Ala 


Gin 


Gin 


Glu 


Lys 


Thr 


Arg 


Phe 


Phe 


Gin 


Ser 


Thr 


Leu 


385 










390 










395 










400 


Leu 


His 


Gly 


Leu 


Pro 


Ala 


Gly 

Jr 


Trp 


Thr 


He 


Tyr 


Glv 


Gly 


Thr 


Gin 


Leu 










405 










410 










415 




Ala 


Asp 


Arg 


Tyr 


Arg 


Ala 


Phe 


Asn 


Phe 


Gly 


He 


Glv 


Lys 


Asn 


Met 


Gly 








420 










425 










'i O U 






Ala 


Leu 


Gly 


Ala 


Leu 


Ser 


Val 


Asp 


Met 


Thr 


Gin 


Ala 


Asn 


Ser 


Thr 


Leu 






435 










440 










4 R 

'I 4 ^ 








Pro 


Asp 


Asp 


Ser 


Gin 


His 


Asp 


Gly 


Gin 


Ser 


Val 


Arg 




Leu 


Tvr 

± y j_ 


Asn 




450 










455 










460 










Lvs 


Ser 


Leu 


Asn 


Glu 


Ser 


Glv 


Thr 


Asn 


He 


Gin 


Leu 


V ct _L 


o_L y 


Tvr 


Arg 


4 65 










470 










475 










4 fin 
^ o u 


Tvr 


Ser 


Thr 


Ser 


Gly 


TVT 


Phe 


Asn 


Phe 




Asp 


Thr 


1 XIX 


lyr 




riX y 










4 85 
























Met 


Asn 


Gly 


Tvr 


Asn 


Ile 


Glu 


Thr 


Gin 


Asp 


Gly Val 


He 


Gin 


Val 


juy o 








500 










505 










510 






Pro 


Lys 


Phe 


Thr 


Asp 


Tvr 


Tvr 


Asn 


Leu 


Ala 


Tyr 


Asn 


Lys 


Arg 


Gly 


Lys 






515 










520 










525 








Leu 


Gin 


Leu 


Thr 


Val 


Thr 


Gin 


Gin 


Leu 


Gly 


Arg 


Thr 


Ser 


Thr 


Leu 


Tvr 

X y X 




530 










535 










540 










Leu 


Ser 


Gly 


Ser 


His 


Gin 


Thr 


Tvr 


m 


Gly 


Thr 


Ser 


Asn 


Val 




VjjX LI 


545 










550 










555 












Gin 


Phe 


Gin 


Ala 


Gly 


Leu 


Asn 


Thr 


Ala 


Phe 


Glu 


Asp 


He 


Asn 


Trn 


Thr 










565 










570 










-J / .J 




Leu 


Ser 


Tyr 
J. jf j_ 


Ser 


Leu 


Thr 


Lvs 


Asn 


Ala 


Tro 


Gin 


Lys 


Gly Arg 


Asp 


Gin 








580 










585 










590 






Met 


Leu 


Ala 


Leu 


Asn 


Val 


Asn 


He 


Pro 


Phe 


Ser 


His 


Trp 


Leu 


Arg 


Ser 






595 










600 










605 








Asp 


Ser 


Lys 


Ser 


Gin 


Tro 


Arg 


His 


Ala 


Ser 


Ala 


Ser 


Tyr 


Ser 


Met 


Ser 




610 










615 










620 










His 


Asp 


Leu 


Asn 


Gly 


Arg 


Met 


Thr 


Asn 


Leu 


Ala 


Gly 


Val 


Tyr 


Gly 


Thr 


625 










630 










635 












Leu 


Leu 


Glu 


Asp 


Asn 


Asn 


Leu 


Ser 


Tvr 

X y X 


Ser 


Val 


Gin 


Thr 


Gly 


lyx 


Alp 










645 










650 










\j ~j o 




Gly 


Gly 


Gly 

w -1- _y 


Asp 


Glv 


Asn 


Ser 


Gly 


Ser 


Thr 


Gly 


Tyr 


Ala 


Thr 


Leu 


Asn 








660 










665 










670 






Tyr 


Arg 


Gly 


Gly 


Tyr 


Gly 


Asn 


Ala 


Asn 


He 


Gly 


Tyr 


Ser 


His 


Ser 


Asp 






675 










680 










685 








Asp 


He 


Lys 


Gin 


Leu 


Tyr 


Tyr 


Gly 

J: 


Val 


Ser 


Gly 


Gly 


Val 


Leu 


Ala 


His 




690 










695 










700 










Ala 


Asn 


Gly 


Val 


Thr 


Leu 


Gly 


Gin 


Pro 


Leu 


Asn 


Asp 


Thr 


Val 


Val 


Leu 


705 










710 










715 










720 


Val 


Lys 


Ala 


Pro 


Gly 


Ala 


Lys 


Asp 


Ala 


Lys 


Val 


Glu 


Asn 


Gin 


Thr 


Gly 










725 










730 










735 




Val 


Arg 


Thr 


Asp 


Trp 


Arg 


Gly 


Tyr 


Ala 


Val 


Leu 


Pro 


Tyr Ala 


Thr 


Glu 








740 










745 










750 
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Tyr 

Val 

He 
785 
Thr 

Ser 

Tyr 

Glu 

Ser 
865 



Arg 

Asp 
770 
Val 

Leu 

Glu 

Leu 

Glu 
850 
Gin 



Glu 
755 
Leu 

Arg 

Thr 

Ser 

Ser 
835 
Glu 

Gin 



Asn Arg Val Ala 
Asp Asn 
Ala Glu 



His Asn 
805 
Ser Gin 
820 

Gly Met 
Asn Ala 
Gin Leu 



Ala Val 
775 
Phe Lys 
790 

Asn Lys 



Ser Ser 

Pro Leu 

His Cys 
855 
Leu Thr 
870 



Leu 
760 
Ala 

Ala 

Pro 

Gly 

Ala 
840 
Val 

Gin 



Asp 

Asn 

Arg 

Leu 

He 
825 
Gly 

Ala 

Leu 



Thr Asn 

Val Val 

Val Gly 
795 
Pro Phe 
810 

Val Ala 

Lys Val 

Asn Tyr 

Ser Ala 
875 



Thr 

Pro 
780 
He 

Gly 

Asp 

Gin 

Gin 
860 
Glu 



Leu 
765 
Thr 

Lys 

Ala 

Asn 

Val 
845 
Leu 

Cys 



Ala Asp Asn 

Arg Gly Ala 

Leu Leu Met 
800 

Met Val Thr 
815 

Gly Gin Val 
830 

Lys Trp Gly 
Pro Pro Glu 
Arg 



<210> 365 
<211> 176 
<212> PRT 
<213> E. Coli 



<400> 365 

Met Arg Asn Lys Pro Phe Tyr Leu Leu Cys Ala Phe Leu Trp Leu Ala 

15 10 15 

Val Ser His Ala Leu Ala Ala Asp Ser Thr He Thr He Arg Gly Tyr 

20 25 30 

Val Arg Asp Asn Gly Cys Ser Val Ala Ala Glu Ser Thr Asn Phe Thr 

35 40 45 

Val Asp Leu Met Glu Asn Ala Ala Lys Gin Phe Asn Asn He Gly Ala 

50 55 60 

Thr Thr Pro Val Val Pro Phe Arg He Leu Leu Ser Pro Cys Gly Asn 
65 70 75 80 

Ala Val Ser Ala Val Lys Val Gly Phe Thr Gly Val Ala Asp Ser His 

85 90 95 

Asn Ala Asn Leu Leu Ala Leu Glu Asn Thr Val Ser Ala Ala Ser Gly 

100 105 110 

Leu Gly He Gin Leu Leu Asn Glu Gin Gin Asn Gin He Pro Leu Asn 

115 120 125 

Ala Pro Ser Ser Ala Leu Ser Trp Thr Thr Leu Thr Pro Gly Lys Pro 

130 135 140 

Asn Thr Leu Asn Phe Tyr Ala Arg Leu Met Ala Thr Gin Val Pro Val 
145 150 155 160 

Thr Ala Gly His He Asn Ala Thr Ala Thr Phe Thr Leu Glu Tyr Gin 
165 170 175 



<210> 366 
<211> 167 
<212> PRT 
<213> E. Coli 



<400> 366 

Met Lys Trp Cys Lys Arg Gly Tyr Val Leu Ala Ala He Leu Ala Leu 
15 10 15 
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Ala 


Ser 


Ala 


Thr 


He 


Gin 


Ala 


Ala 


Asp 


Val 


Thr 


He 


Thr 


Val 


Asn 


Gly 








20 










25 










30 






Lvs 

J: 


Val 


Val 


Ala 


Lvs 

-1-1 Jf w 


Pro 


Cys 


Thr 


Val 


Ser 


Thr 


Thr 


Asn 


Ala 


Thr 


Val 






35 










40 










45 








Asp 


Leu 


Glv 


Asp 


Leu 


Tvr 


Ser 


Phe 


Ser 


Leu 


Met 


Ser 


Ala 


Gly Ala Ala 




50 










55 










60 










Ser 


Ala 


Trp 


His 


Asp 


Val 


Ala 


Leu 


Glu 


Leu 


Thr 


Asn 


Cys 


Pro 


Val 


Gly 


65 










70 










75 










80 


Thr 


Ser 


Arg 


Val 


Thr 


Ala 


Ser 


Phe 


Ser 


Gly 


Ala 


Ala 


Asp 


Ser 


Thr 


Gly 










85 










90 










95 




Tvr 


Tvr 


Lys 


Asn 


Gin 


Gly 


Thr 


Ala 


Gin 


Asn 


He 


Gin 


Leu 


Glu 


Leu 


Gin 








100 










105 










110 






Asp 


Asp 


Ser 


Gly 


Asn 


Thr 


Leu 


Asn 


Thr 


Gly 


Ala 


Thr 


-i-!_y o 


Thr 


Val 


Gin 






115 










120 










125 








Val 


Asp 


Asp 


Ser 


Ser 


Gin 


Ser 


Ala 


His 


Phe 


Pro 


Leu 


Gin 


Val Arg Ala 




130 










135 










140 










Leu 


Thr 


Val 


Asn 


Gly 


Gly 


Ala 


Thr 


Gin 


Gly 


Thr 


He 


Gin 


Ala 


Val 


He 


145 










150 










X -J -J 










160 


Ser 


He 


Thr 


i yx 


J- J.1JL 


lyx 


O t; -L 




























J- U 


























<210> 


367 




























<211> 


300 




























<212> 


PRT 




























<213> 


E. Coli 


























<400> 


367 


























Met 


Lys 


Arg 


Val 


He 


Thr 


Leu 


IT iiC 




V Cl _L 


Leu 


Leu 


iYie I- 


Cnly 


Trp 


Ser 


1 








5 




















lo 




Val 


Asn 


Ala 


Trp 


Ser 


Phe 


Ala 


Cys 


Lys 


Thr 


Ala 




\j J- y 


i nr 


Ala 


lie 








20 










9 S 










J u 






Pro 


He 


Gly 


Gly 


Gly 


Ser 


Ala 


Asn 


Val 


T VT 


Val 


Asn 


Jut; U. 


Ala 


Pro 


Val 






35 










4 0 


















Val 


Asn 


Val 


Gly 


Gin 


Asn 


Leu 


Val 


Val 


Asp 


±J t- Li 




1 ILI. 


Gin 


He 


Phe 




50 










55 










o u 










Cys 


His 


Asn 


Asp 


Tyr 


Pro 


Glu 


Thr 


He 


Thr 


Asp 




V d -L 


Thr 


Leu 


Gin 


65 










70 




















80 


Arg 


Gly 


Ser 


Ala 


Tyr 


Gly 


Gly 


Val 


Leu 


Ser 


Asn 


Phe 


Ser 


Gly 


Thr 


Val 










85 










90 










95 




Lvs 


Tyr 


Ser 


Gly 


Ser 


Ser 


Tvr 


Pro 


Phe 


Pro 


Thr 


Thr 


Ser 


Glu 


Thr 


Pro 








100 










105 










110 






Arg 


Val 


Val 


Tyr 


Asn 


Ser 


Arg 


Thr 


Asp 


Lys 


Pro 


LLP 


Pto 


Val 


Ala 


Leu 






115 










120 










± Z. J 








Tvr 
J- J ->- 


Leu 


Thr 


Pro 


Val 


Ser 


Ser 


Ala 


Gly 


Gly 


Val 


Ala 


He 


Lys 


Ala 


Gly 




130 










135 










1 4 n 










Ser 


Leu 


He 


Ala 


Val 


Leu 


He 


Leu 


Arg 


Gin 


Thr 


Asn 


A c;ti 


Tyr 


Asn 


Ser 


145 










150 










J. .J o 










160 


Asp 


Asp 


Phe 


Gin 


Phe 


Val 


Trn 


Asn 


He 


1 yx 


Ala 


Asn 


A cin 


Asp 


Val 


Val 










165 










J. / u 










175 




Val 


Pro 


Thr 


Gly 


Gly 


Cys 


Asp 


Val 


Ser 


Ala 


Arg 


Asp 


Val 


Thr 


Val 


Thr 








180 










185 










190 






Leu 


Pro 


Asp 


Tyr 


Pro 


Gly 


Ser 


Val 


Pro 


He 


Pro 


Leu 


Thr 


Val 


Tyr 


Cys 






195 










200 










205 






Ala 


Lys 


Ser 


Gin 


Asn 


Leu 


Gly 


Tyr 


Tyr 


Leu 


Ser 


Gly 


Thr 


Thr 


Ala 


Asp 




210 










215 










220 











Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 

245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 

260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 

275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 



<210> 
<211> 
<212> 
<213> 



368 
521 
PRT 

E. Coll 



Met 
1 

Thr 

Thr 

Ala 

Gly 

65 

Leu 

Ser 

Tyr 

Gly 

Ser 
145 
Val 

Cys 

Tyr 

Ala 

Thr 
225 
Ala 

Lys 

Ala 

Ala 



<400> 
Leu Ser 

He Arg 

Thr Gly 
35 

Leu Gin 
50 

Val Leu 

Tyr Phe 

Asn Asp 

Gly Asn 
115 
Cys Arg 
130 

Gin Lys 

Arg Lys 

Gin Ala 

Gly Leu 
195 
Pro Gin 
210 

Met Ala 

Thr Val 

Glu Lys 

Trp Val 
275 
Gly He 
290 



368 

Lys Leu Pro 
5 

Val His Arg 
20 

Val Ala Tyr 

Leu Leu Asp 

Gly Ser Leu 
70 

Lys He Lys 
85 

Ser Arg Leu 
100 

Cys Ser Leu 

His Gly Pro 

Glu Arg Pro 
150 

Ser Val Leu 
165 

Arg Ala Ser 
180 

Ser Ala Ala 

He Leu Asp 

Tyr Arg Asp 
230 

Val Asp Gly 
245 

Cys Asp Gin 
260 

Glu Arg Asn 
Ala Ser Phe 



Arg Arg 

Gly Glu 

Gly Thr 
40 

Lys Val 
55 

Val Phe 

Glu Asp 

Thr Gly 

Thr Gly 
120 
Cys Arg 
135 

Ala Ser 

Ser Ala 

Ser Ser 

Val Leu 
200 
Gin Phe 

215 

Gly Ser 

Lys Thr 

Val Asn 

He Lys 
280 
Cys Pro 
295 



Leu Arg 
10 

Asp Met 
25 

Ser Ala 

Thr Pro 

Gly Leu 

Arg Arg 
90 

Cys Glu 
105 

Gin Arg 

Ser Cys 

Leu Pro 

Ala Ser 
170 
He Ser 
185 

Ala Leu 

Leu Asp 

Gly He 

Val Phe 
250 
Ala He 
265 

Val Pro 
Tyr Asn 



Ser Phe Gin 

Lys Ser Met 

Gly Asn Ala 
45 

Ser Gin Trp 
60 

Leu Thr Tyr 
75 

Lys Ala Ala 

Arg Ser Pro 

Thr Leu Arg 
125 

Ala Gly Val 

140 
Gly Ser Ser 
155 

Val Leu Leu 

Met Asn Thr 

He Gly Ala 
205 

Glu Lys Glu 

220 
Trp Thr He 
235 

Pro Asn Met 

Glu Arg Asp 

Leu Thr Glu 
285 

He Gly Pro 
300 



Thr Tyr Cys 
15 

Asp Lys Leu 
30 

Gly Phe Trp 

Ala Ala He 

Leu Thr Asn 
80 

Arg Gly Glu 
95 

Phe Glu Ser 
110 

Asn Phe Pro 

Leu Gly Ser 

Arg Lys He 
160 

Asp Lys Ser 
175 

Lys He Arg 
190 

Gly Ala Ser 

Gly Asn His 

Cys Arg Gly 
240 

Lys Leu Ser 

255 
Lys Ala Leu 
270 

Pro Gin Lys 
Gly Lys Cys 
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Phe 
305 
Ala 

Arg 

Glu 

Phe 

Glu 
385 
Lys 

Val 

Pro 

Gin 

Leu 
465 
Ser 

Lys 

Asn 



Pro 

Cys 

lie 

Ser 

Phe 
370 
Asp 

Arg 

Met 

Gly 

Lys 
450 
Pro 

Pro 

Leu 

Leu 



Ser Thr Phe 
Glu 
Arg 



Ala 



Ala 
355 
Ser 

Gly 

Met 

Arg 

Leu 
435 
Asn 

Leu 

Thr 

Leu 

Glu 
515 



Ser 
340 
Leu 

Cys 

Lys 

Leu 

Leu 
420 
Leu 

Met 

Pro 

Cys 

Thr 
500 
He 



He 
325 
Asn 



Thr 

Cys 

Lys 

Asn 
405 
Leu 

He 

Pro 

Ser 

Ser 
485 
Asn 

Lys 



Tyr 
310 
Arg 

Asn 

Cys 

Gin 

Asn 
390 
Leu 

Lys 

Cys 

Tyr 

Glu 
470 
Glu 

Ala 

Ser 



Lys Arg 

Trp Trp 

Cys Tyr 

Trp Gly 
360 
Asp Leu 
375 

Gly Arg 

Leu Leu 

Met Arg 

Leu He 
440 
Ala Leu 
455 

Ala Lys 

He Leu 

Thr Ser 

He Lys 
520 



Leu Asn Ala 
315 

He Lys Asp 

330 
Gly Gin Val 
345 

He Glu Gin 

Ser Ser Glu 

Asn Val Met 
395 

Glu Leu Asn 
410 

Asn Arg Asn 
425 

Val Leu Thr 

Thr Gin Arg 

Gin Pro Lys 
475 

Gin Gin Lys 
490 

Gin Glu Leu 

505 

Cys 



Gly Asp Arg 

Gly Gly Arg 

He Arg Arg 
350 

He Arg Tyr 
365 

Met Ser Gly 
380 

Leu Pro His 

Arg Gly Glu 

Leu Leu Lys 
430 

Ser Cys Val 

445 
Ser He Pro 
460 

Pro Pro Lys 

Leu Ser Phe 

Val Asn Arg 
510 



Lys Gly 
320 
Asp Cys 
335 

Asp Gin 

Ser Trp 

Ala Thr 

Tyr His 
400 
Leu Pro 
415 

Phe Leu 

Pro Lys 

Gin He 

Glu Cys 
480 
Met Leu 
495 

Ser Met 



<210> 
<211> 
<212> 
<213> 



369 
177 
PRT 

E. Coli 



Met 
1 

He 

Glu 

Trp 

Pro 

65 

Glu 

Leu 

He 

Ala 

Asp 
145 



<400> 
Asn Thr 

Gly Ala 

Lys Glu 

35 
Thr He 
50 

Asn Met 

Arg Asp 

Thr Glu 

Gly Pro 
115 
Gly Asp 
130 

Gly Gly 



369 

Lys He 

5 

Gly Ala 
20 

Gly Asn 

Cys Arg 

Lys Leu 

Lys Ala 

85 
Pro Gin 
100 

Gly Lys 
Arg Lys 
Arg Asp 



Arg Tyr Gly Leu 
Ser Ala 
His Thr 



Gly Ala 

55 
Ser Lys 
70 

Leu Ala 

Lys Ala 

Cys Phe 

Gly Ala 
135 
Cys Arg 
150 



Pro Gin 

25 
Met Ala 
40 

Thr Val 



Glu Lys 

Trp Val 

Gly He 
105 
Pro Ser 
120 

Cys Glu 
He Arg 



Ser Ala Ala 
10 

He Leu Asp 

Tyr Arg Asp 

Val Asp Gly 
60 

Cys Asp Gin 
75 

Glu Arg Asn 
90 

Ala Ser Phe 

Thr Phe Tyr 

Ala He Arg 
140 

Ser Asn Asn 
155 



Val Leu 

Gin Phe 
30 

Gly Ser 
45 

Lys Thr 

Val Asn 

He Lys 

Cys Pro 
110 
Lys Arg 
125 

Trp Trp 
Cys Tyr 



Ala Leu 
15 

Leu Asp 

Gly He 

Val Phe 

Ala He 
80 

Val Pro 
95 

Tyr Asn 

Leu Asn 

He Lys 

Gly Gin 
160 



-198- 



Val lie Arg Arg Asp Gin Glu Ser Ala Leu Thr Cys Trp Gly lie Glu 
165 170 175 

Gin 



<210> 370 
<211> 103 
<212> PRT 
<213> E. Coll 



<400> 370 
Met Thr Gin Asp 
1 

Asn Lys Val Thr 
20 

Gly Glu He Phe 
35 

Ala Leu Glu Phe 
50 

Phe Leu Glu Val 
65 

Gin His Gin Tyr 

Arg Pro His His 
100 



Tyr Glu Leu Val 
5 

Val Thr Val Ala 

Asp Leu Asp Val 
40 

Tyr Glu Ala Ala 
55 

Ala Glu Lys Leu 
70 

Ser Phe Lys He 
85 

Lys Tyr Leu 



Val Lys Gly Val 
10 

Leu Gin Asp Lys 
25 

Ala Met Asp Arg 

Ala Arg Arg Ser 
60 

Ser Glu Lys Val 
75 

Glu Asn Pro Ala 
90 



Arg Asn Phe Glu 
15 

Glu Arg Phe Asp 
30 

Val Glu Gly Ala 
45 

Val Arg Gin Val 

Glu Ser Tyr Leu 
80 

Asn Lys His Glu 
95 



<210> 371 
<211> 96 
<212> PRT 
<213> E. Coli 



<400> 371 
Met Leu Ser Lys 
1 

Thr He Arg Val 
20 

Thr Thr Gly Val 
35 

Ala Leu Gin Leu 
50 

Gly Val Leu Gly 
65 

Leu Tyr Phe Lys 



Leu Pro Arg Arg 
5 

His Arg Gly Glu 

Ala Tyr Gly Thr 
40 

Leu Asp Lys Val 
55 

Ser Leu Val Phe 
70 

He Lys Glu Asp 
85 



Leu Arg Ser Phe 
10 

Asp Met Lys Ser 
25 

Ser Ala Gly Asn 

Thr Pro Ser Gin 
60 

Gly Leu Leu Thr 
75 

Arg Arg Lys Ala 
90 



Gin Thr Tyr Cys 
15 

Met Asp Lys Leu 
30 

Ala Gly Phe Trp 
45 

Trp Ala Ala He 

Tyr Leu Thr Asn 
80 

Ala Arg Gly Glu 
95 



<210> 372 
<211> 71 
<212> PRT 
<213> E. Coli 



<400> 372 

Met Ser Asn Lys Met Thr Gly Leu Val Lys Trp Phe Asn Ala Asp Lys 



15 10 15 

Gly Phe Gly Phe He Ser Pro Val Asp Gly Ser Lys Asp Val Phe Val 

20 25 30 

His Phe Ser Ala He Gin Asn Asp Asn Tyr Arg Thr Leu Phe Glu Gly 

35 40 45 

Gin Lys Val Thr Phe Ser He Glu Ser Gly Ala Lys Gly Pro Ala Ala 

50 55 60 

Ala Asn Val He He Thr Asp 
65 70 



<210> 
<211> 
<212> 
<213> 



373 
338 
PRT 

E. Coli 



Met 
1 

Leu 

Lys 

Met 

Arg 

65 

Trp 

Ala 

Asp 

Leu 

Gly 
145 
Ala 

His 

He 

Ala 

Glu 
225 
Arg 

Pro 

Ala 

His 

Tyr 



<400> 
Phe Val 

Thr Phe 

Glu He 

35 
Val Glu 
50 

Pro Tyr 

Tyr Asn 

Ser Met 

Arg Thr 
115 
Ala Arg 
130 

Val Met 

Pro Ser 

Gin Thr 

Gly Val 
195 
Ala Asn 
210 

Glu Gin 

Glu Glu 

Gly Lys 

He Asn 
275 
Pro Phe 
290 

Lys Gly 



373 

He Trp 

5 

Ala Asp 
20 

Phe Leu 

Val He 

Pro Leu 

Leu Ser 
85 

Arg Leu 
100 

Thr He 

Gin Leu 

Met Thr 

Ser Thr 
165 
Lys Lys 
180 

Asp Ala 

Glu His 

Phe Val 

Leu Ala 
245 
Val Arg 
260 

He Glu 
Arg He 
Leu Leu 



Ser His Arg Thr 
Ser Glu 
Ser Arg 



Glu Pro 

55 
Glu Thr 
70 

Asp Gly 

Phe Ala 

Met Asn 

Phe Lys 
135 
Gin Gly 
150 

Lys Asn 

Gly Asn 

Lys Ser 

Asp Leu 
215 
Ser Ala 
230 

Glu Val 

Thr Leu 

Tyr Met 

He Lys 
295 
Lys Asn 



Phe Ser 

25 
Met Glu 
40 

Phe Tyr 



Met Leu 

Ala Met 

Arg Leu 
105 
Phe Arg 
120 

Thr He 

Thr Leu 

Lys Glu 

Gin Trp 
185 
Gly Leu 
200 

Asn Gin 

Asp Ala 

Asp Val 

Lys Gin 
265 
Lys Ala 
280 

Arg Gin 



Gly Phe 
10 

Ser Lys 

Gin He 

Pro Lys 

Arg He 
75 

Glu Asp 
90 

Ser Leu 

His Leu 

Asn Arg 

Val Asp 
155 
Gin Gin 
170 

His Phe 

Thr His 

Leu Gly 

Gly Tyr 
235 
Asp Trp 
250 

His Pro 



Ser He 
Phe Gly 
Asp Asn Gin Leu 



He Met Ser 

Arg Arg Gin 
30 

Leu Pro Trp 
45 

Ala Gly Asn 
60 

His Cys Met 

Ala Leu Tyr 

Asp Ser Ala 
110 

Leu Glu Gin 
125 

Trp Leu Ala 
140 

Ala Thr He 

Arg Asp Pro 

Gly Met Lys 
190 

Ser Leu Val 

205 
Asn Leu Leu 
220 

Gin Gly Ala 

Leu He Ala 

Arg Lys Asn 
270 

Arg Ala Arg 

285 
Phe Val Lys 
300 

Ala Met Leu 



His Gin 
15 

Thr Arg 
Gin Asn 
Gly Arg 

Gin His 

80 

Glu He 
95 

Leu Pro 

His Gin 

Glu Ala 

He Glu 
160 
Glu Met 
175 

Ala His 

Thr Thr 

His Gly 

Pro Gin 
240 
Glu Arg 
255 

Lys Thr 
Val Glu 
Ala Arg 
Phe Thr 
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305 310 315 320 

Leu Ala Asn Leu Phe Arg Ala Asp Gin Met lie Arg Gin Trp Glu Arg 
325 330 335 

Ser His 



<210> 
<211> 
<212> 
<213> 



374 
157 
PRT 

E. Coll 



Met 

1 
Lys 

Val 

Tyr 

Asn 

65 

Asp 

Asp 

Phe 

Tyr 

Met 
145 



<400> 
Val Tyr 

Leu Leu 

Arg Asp 
35 

Ala Gly 
50 

Asn Asn 

Asp Asp 

Asp Leu 

Ser Thr 
115 
Ser Val 
130 

Leu Gly 



374 

He He 
5 

Glu Asn 
20 

Asn Lys 

Leu Asp 

He Ala 

Tyr He 
85 

Leu Thr 
100 

Leu Cys 
Arg Lys 
He Lys 



He Val Ser 

Leu Asn Ala 

Asp Ser Leu 
40 

Tyr He Ser 
55 

Val Ala Tyr 
70 

Leu Phe Leu 

Tyr He Lys 

Leu Phe Arg 
120 

Phe Pro Val 
135 

Glu Gly Ala 
150 



His Gly His 
10 

Asp Asp Glu 
25 

Leu Leu Lys 
Gly Gly Val 

Val Lys Glu 

75 

Asn Pro Asp 
90 

Tyr Val Glu 
105 

Asp Glu Ala 

Leu Ser Asp 

Asn Lys Ser 
155 



Glu Asp Tyr 

His Tyr Lys 
30 

Gin He Cys 
45 

Tyr Gly Phe 
60 

Lys Tyr Arg 

He He Met 

Ser Lys Arg 
110 

Lys Ser Leu 

125 
Phe He Val 
140 

Leu He 



He Lys 
15 

He He 

Gin His 

Gly His 

Pro Ala 

80 
Lys His 
95 

Tyr Ala 
His Asp 
Ser Phe 



<210> 
<211> 
<212> 
<213> 



375 
372 
PRT 

E. Coll 



Met 
1 

Gly 

Glu 

Glu 

Ser 

65 

Ser 



<400> 
Gly Lys 

Pro Phe 

Asn Val 

35 
Ser Tyr 
50 

Trp Leu 
Lys Glu 
Asn Val 



Ala 
Pro Phe Tyr 



375 

Ser He Val 
5 

Thr He Leu 
20 

Ser Phe He 

Pro Trp Val 

Lys Arg Leu 
70 

Leu Asn Ala 
85 

Val Thr Lys 
100 

Lys Gly He 



Val Val Ser 

Lys Lys Phe 
25 

Ala Leu Val 
40 

Lys Phe He 
55 

His Phe Glu 

Thr His Trp 

Lys Arg Tyr 
105 

Leu Phe Arg 



Ala Val Asn 
10 

Leu Ala Ala 

His Ser Ala 

Glu Phe Pro 
60 

Tyr Val Val 
75 

He Cys Leu 
90 

Val Tyr Cys 
Glu He Leu 



Phe Thr 

Thr Asn 

30 
Lys Glu 
45 

Glu Val 

Cys Lys 

His Asp 

His Asn 
110 
Met Glu 



Thr Gly 
15 

Asn Lys 

Leu Lys 

Lys Gly 

Lys Leu 
80 

He Thr 
95 

Pro Ala 
Pro Ser 
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Phe 

Lys 
145 
Phe 

He 

Lys 

Val 

Glu 
225 
Ala 

His 

He 

Pro 

Phe 
305 
Phe 

Lys 

Arg 

Thr 



Phe 
130 
Lys 

He 

Lys 

Asn 

Phe 
210 
Gin 

Tyr 

Phe 

Ser 

Leu 
290 
Pro 

Asp 

Lys 

Asn 

Glu 
370 



115 
Leu 

Asn 

Lys 

Leu 

Asn 
195 
Lys 

Ser 

Ala 

Leu 

Asp 
275 
Ser 

Phe 

Ser 

Gly 

Glu 
355 
Glu 



Phe Lys Met 
Thr 
Lys 



Ser 
180 
Pro 

Asn 

Asn 

Lys 

Gly 
260 
He 

Glu 

Thr 

Asn 

Asn 
340 
Asn 

His 



Ala Val 
150 
Tyr Ser 
165 

Asp Lys 



Ser Glu 

Tyr Glu 

He Lys 
230 
Tyr He 
245 

Tyr Leu 

Val Cys 

Ala Lys 

Arg Glu 
310 
Asn Asp 
325 

Leu Lys 



Leu 
135 
Phe 

He 

Ser 

Leu 

Leu 
215 
Phe 

He 

Asp 

Phe 

Glu 
295 
Thr 



Asp 
Lys 
Val Leu Val 



120 

Tyr Gly 

Val Gin 

Asn Asn 

Gin Leu 
185 
Thr He 
200 

He He 

Leu Leu 

Ser Leu 

Lys Glu 
265 
Pro Ser 
280 

Arg Gly 

Leu Gly 

Met Leu 

Asp He 
345 
Gly Phe 
360 



Leu He 

Gin Phe 
155 
He He 
170 

Thr Asp 

Phe Tyr 

Ser Ala 

Thr He 
235 
Ala Glu 
250 

Lys He 

Arg Leu 

Lys Trp 

Ser Tyr 
315 
Val Lys 
330 

Ser Asp 



Tyr 
140 
Trp 

Val 

Asp 

Pro 

Ala 
220 
Ser 

Gly 

Asp 

Glu 

Val 
300 
Glu 



Leu 
Ala 
Asp Glu Leu 



125 

Lys 

Met 

Ser 

Asp 

Ala 
205 
Arg 

Gly 

Leu 

His 

Thr 
285 
Leu 

Lys 

He 

Asn 

Val 
365 



He Asn He 

Lys Glu Lys 
160 

Arg Pro Glu 

175 
Ser Gin Phe 
190 

Val Pro Arg 

Lys Leu Lys 

Thr Glu Asn 
240 

Asp Asn Val 

255 
Cys Tyr Asn 
270 

Trp Gly Leu 

Ala Ser Asp 

Lys Ala Phe 
320 

He Asp Phe 

335 
Phe He Tyr 
350 

Asn Phe He 



<210> 
<211> 
<212> 
<213> 



376 
196 
PRT 

E. Coll 



<400> 
Met He Leu 
1 

Leu Val Arg 

He He Arg 
35 

Gly Glu Asn 
50 

Arg Gly Val 
65 

His He Ala 

Ala Ser Lys 

Ser Asp Pro 
115 



376 

Lys Leu Ala 
5 

Asp Val Leu 
20 

Phe Pro Cys 

Phe Thr Ser 

He Phe Phe 
70 

Ser He Glu 
85 

Val Phe He 
100 

Met Ser Ser 



Lys Arg 

Leu Thr 

Tyr He 

40 
Gly Val 
55 

Ser Asp 

Ser Val 

Thr Asp 

Pro Asn 
120 



Tyr Gly 
10 

Arg Val 
25 

Arg Asn 

Gly Leu 

Asn Val 

Thr He 
90 

His Asn 
105 

He Pro 



Leu Cys Gly Phe 
Phe Tyr 
Asp Gly 



Arg Leu 
60 

Gin Val 
75 

Gly Arg 
His Gly 
Pro Asp 



Arg Asn 

30 
Ser He 
45 

Asp Ala 



Asn Asp 

Asp Thr 

Ser Phe 
110 
Met Arg 
125 



He Arg 
15 

Cys Arg 

Asn Phe 

Phe Gly 

Tyr Val 

80 
Leu He 
95 

Lys His 
Thr Leu 
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Glu Ser Ser Ala 
130 

Val Thr Val Leu 
145 

Ala Asn Ser Val 

Gly Val Pro Ala 
180 

Trp Glu Lys Ala 
195 



Val Val He Gly 
135 

Pro Gly Thr He 
150 

Val Arg Gly Ser 
165 

Lys He He Lys 



Gin Arg Val Trp 
140 

He Gly Asn Gly 
155 

He Pro Glu Asn 
170 

Lys Tyr Asn His 
185 



Leu Gly Glu Asn 

Val Val Val Gly 
160 

Thr Val He Ala 
175 

Glu Thr Lys Leu 
190 



<210> 377 
<211> 330 
<212> PRT 
<213> E, Coll 



<400> 
Met Tyr Phe 
1 

Lys Ala Arg 

Ser Val Val 
35 

Ser Ser Val 
50 

Val Leu He 
65 

Ser Phe Phe 

Asp He Asp 

Ala Thr Cys 
115 

Leu Ser Lys 
130 

Asp Tyr Leu 
145 

Gin Arg Gly 

Phe He Tyr 

Glu Asn Lys 
195 

Pro Glu Lys 

210 
Gly Asp Ser 
225 

Phe Asn Asn 

Val Phe He 

Arg He Gly 
275 

Asp Ser Met 

290 
He He Ser 
305 



377 

Leu Asn Asp 
5 

Lys Asp Ala 
20 

Asn He Pro 
Lys Leu Ser 

Phe Asn Phe 

70 

His Arg Leu 
85 

Glu Leu Arg 
100 

Asp Met Val 

Tyr Met Ser 

Val Ser Ser 
150 

Val He Tyr 
165 

Thr Glu Gly 
180 

Asp Asn Pro 

He Asn Leu 

Val Glu Thr 
230 

Pro His Lys 

245 
Trp Asp Lys 
260 

Tyr Ala Val 

Thr He Glu 

Gin Lys He 
310 



Leu Asn 

Leu Asp 

Leu Trp 

40 
Thr Phe 
55 

Pro Met 

Leu Lys 

Gly Gly 

He Ser 
120 
Gin Asp 
135 

Asp Val 

Ala Gly 

Cys Asp 

Lys Tyr 
200 
Pro Gly 
215 

Cys Ser 

Thr Ser 

Ala Ala 

Gly Ser 
280 
Thr Tyr 
295 

Arg Thr 



Phe Ser Arg 
10 

He Ala Ser 
25 

Gly Gly Val 

Leu Cys Gly 

Ala Lys Pro 
75 

Phe Arg He 
90 

Gly Gly Ser 
105 

His Asn Pro 

Lys He Lys 

Glu His Arg 
155 

Asn Leu Ser 
170 

Phe Thr Leu 
185 

Leu Gly Ser 

Met Gin Phe 

Gly Ala Phe 
235 

Leu Tyr Leu 

250 
Leu Ala Asp 
265 

He Lys Glu 

Lys Gin He 

Gly Ser Tyr 
315 



Arg Asp 

Asp Tyr 

Val Gin 
45 

Leu Glu 
60 

Phe Trp 

Val Pro 

Asp Ser 

Gin Met 
125 
Asp He 
140 

Asp Val 

Arg His 

Phe Gly 

Phe Asp 
205 
Gly Leu 
220 

Gly Asp 

Ser Met 

Phe He 

Met Gin 
285 
Ser Glu 
300 

Phe Arg 



Ala Gly Phe 
15 

Glu Asn He 
30 

Arg He He 

Asn Lys Asp 

His He Leu 
80 

Leu He His 
95 

Val Arg Leu 
110 

Thr Lys Tyr 

Lys He Phe 

Thr Asp Lys 
160 

Lys Cys Ser 
175 

Val Asn Tyr 
190 

Ala Gin Ser 

He Trp Asp 

Tyr Leu Lys 
240 

Glu Leu Pro 
255 

Val Asp Asn 
270 

Glu He Val 

Asn Thr Lys 

Asp Val Leu 
320 
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Glu 



Glu Val 



lie Asp 
325 



Asp 



Leu 



Lys 



Thr 



Arg 
330 



<210> 378 
<211> 388 
<212> PRT 
<213> Coli 

<400> 378 

Met lie Tyr Leu Val lie Ser Val Phe Leu lie Thr Ala Phe lie Cys 

15 10 15 

Leu Tyr Leu Lys Lys Asp lie Phe Tyr Pro Ala Val Cys Val Asn lie 

20 25 30 

lie Phe Ala Leu Val Leu Leu Gly Tyr Glu lie Thr Ser Asp lie Tyr 

35 40 45 

Ala Phe Gin Leu Asn Asp Ala Thr Leu lie Phe Leu Leu Cys Asn Val 

50 55 60 

Leu Thr Phe Thr Leu Ser Cys Leu Leu Thr Glu Ser Val Leu Asp Leu 
65 70 75 80 

Asn lie Arg Lys Val Asn Asn Ala lie Tyr Ser lie Pro Ser Lys Lys 

85 90 95 

Val His Asn Val Gly Leu Leu Val lie Ser Phe Ser Met lie Tyr lie 

100 105 110 

Cys Met Arg Leu Ser Asn Tyr Gin Phe Gly Thr Ser Leu Leu Ser Tyr 

115 120 125 

Met Asn Leu lie Arg Asp Ala Asp Val Glu Asp Thr Ser Arg Asn Phe 

130 135 140 

Ser Ala Tyr Met Gin Pro He He Leu Thr Thr Phe Ala Leu Phe He 
145 150 155 160 

Trp Ser Lys Lys Phe Thr Asn Thr Lys Val Ser Lys Thr Phe Thr Leu 

165 170 175 

Leu Val Phe He Val Phe He Phe Ala He He Leu Asn Thr Gly Lys 

180 185 190 

Gin He Val Phe Met Val He He Ser Tyr Ala Phe He Val Gly Val 

195 200 205 

Asn Arg Val Lys His Tyr Val Tyr Leu He Thr Ala Val Gly Val Leu 

210 215 220 

Phe Ser Leu Tyr Met Leu Phe Leu Arg Gly Leu Pro Gly Gly Met Ala 
225 230 235 240 

Tyr Tyr Leu Ser Met Tyr Leu Val Ser Pro He He Ala Phe Gin Glu 

245 250 255 

Phe Tyr Phe Gin Gin Val Ser Asn Ser Ala Ser Ser His Val Phe Trp 

260 265 270 

Phe Phe Glu Arg Leu Met Gly Leu Leu Thr Gly Gly Val Ser Met Ser 

275 280 285 

Leu His Lys Glu Phe Val Trp Val Gly Leu Pro Thr Asn Val Tyr Thr 

290 295 300 

Ala Phe Ser Asp Tyr Val Tyr He Ser Ala Glu Leu Ser Tyr Leu Met 
305 310 315 320 

Met Val He His Gly Cys He Ser Gly Val Leu Trp Arg Leu Ser Arg 

325 330 335 

Asn Tyr He Ser Val Lys He Phe Tyr Ser Tyr Phe He Tyr Thr Phe 

340 345 350 

Ser Phe He Phe Tyr His Glu Ser Phe Met Thr Asn He Ser Ser Trp 

355 360 365 

He Gin He Thr Leu Cys He He Val Phe Ser Gin Phe Leu Lys Ala 
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370 

Gin Lys lie Lys 
385 



375 



380 



Met 
1 

Ala 

Arg 

Gin 

He 

65 

Ser 

Met 

Gin 

Glu 

Gin 
145 
Lys 

Phe 

Gly 

Leu 

His 
225 
Phe 

Glu 

Asn 

Glu 

Val 
305 
Leu 

Phe 

He 



<210> 
<211> 
<212> 
<213> 

<400> 
Tyr Asp 

Asn Glu 

Asn His 

35 
He His 
50 

Trp Asp 

Pro Leu 

Asn Thr 

Asn He 
115 
Asn Leu 
130 

Ala Leu 

Glu Leu 

Asp Asn 

Tyr Thr 
195 
Gly He 
210 

Arg He 

Gly Ala 

Phe Pro 

Val Pro 
275 
Thr Lys 
290 

Gly Asp 
Phe Lys 
Gly Gly 
Ser Ala 



379 
367 
PRT 

E. Coli 
379 

Tyr He 
5 

Leu Lys 
20 

He Gly 

Lys Tyr 

Tyr Val 

Ala He 

85 
Phe His 
100 

He Asn 

Glu Glu 

He Lys 

Pro Ala 
165 
Asn Tyr 
180 

Lys Leu 

Asp Phe 

He Tyr 

Leu Glu 
245 
Asn Phe 
260 

Tyr Thr 

His Thr 

Glu Pro 

Lys Tyr 
325 
Arg Leu 
340 

Ala Leu 



He Val Gly 

Lys Leu Asn 

Gly Asn Ala 
40 

Gly Ala His 
55 

Asn Asp Leu 
70 

Tyr Lys Asp 

Gin Met Trp 

Ala Gin Lys 
120 

Gin Ala He 
135 

Gly Tyr Thr 
150 

Phe He He 

Phe Ser Asp 

He Glu Lys 
200 

Leu Lys Asp 

215 
Thr Gly Pro 
230 

Tyr Arg Ser 

Gin Gly Asn 

Arg He He 
280 

Val Val Thr 

295 
Tyr Tyr Pro 
310 

Arg Glu Leu 
Ala Glu Tyr 
Tyr Gin Val 



Ser Gly 
10 

Lys Lys 
25 

Tyr Thr 

He Phe 

Val Glu 

Lys Leu 
90 

Gly Val 
105 

Lys Lys 

Ser Leu 

Glu Lys 

Lys Arg 
170 
Arg Tyr 
185 

Met Leu 

Lys Asp 

He Asp 

Leu Lys 
250 
Ala Val 
265 

Glu His 

Lys Glu 

Val Asn 

Ala Ser 
330 
Lys Tyr 
345 

Lys Asn 



Leu Phe Gly Ala 
Val Leu 
Glu Asp 



His Thr 
60 

Phe Asn 
75 

Phe Asn 

Lys Asp 

Tyr Gly 

Val Gly 
140 
Gin Trp 
155 

He Pro 

Gin Gly 

Glu Gly 

Ser Leu 
220 
Gin Tyr 
235 

Phe Glu 

He Asn 

Lys His 

Tyr Pro 
300 
Asp Asn 
315 

Arg Glu 
Tyr Asp 
He Met 



Val He 

30 
Cys Glu 
45 

Asn Asp 



Arg Phe 

Leu Pro 

Pro Gin 
110 
Asp Lys 
125 

Glu Asp 

Gly Arg 

Val Arg 

He Pro 
190 
Val Asp 
205 

Ala Ser 

Phe Asp 

Thr Glu 

Phe Thr 
270 
Phe Asp 
285 

Leu Glu 

Lys Asn 

Asp Lys 

Met His 
350 
Ser Thr 



Val Cys 
15 

Glu Lys 

Gly He 

Lys Tyr 

Thr Asn 
80 

Phe Asn 
95 

Glu Ala 

Val Pro 

Leu Tyr 

Ser Ala 
160 
Phe Thr 
175 

Val Gly 

Val Lys 

Lys Ala 

Tyr Arg 
240 
Arg His 
255 

Asp Ala 

Tyr Val 

Trp Lys 

Met Glu 
320 
Val He 
335 

Gin Val 
Asp 
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355 



360 



365 



<210> 
<211> 
<212> 
<213> 



380 
371 
PRT 

E. Coli 



<400> 
Met Phe Pro 
1 

Ala His Gly 

Ser Gly Leu 
35 

Leu Ala Gly 
50 

Pro Pro Gly 
65 

Pro His Leu 

Arg Glu lie 

Val Gly Val 
115 

lie Met Cys 
130 

Ala Gly Leu 
145 

Thr Gly He 

Ser Leu Pro 

Tyr Gin Thr 
195 

Met Ser Thr 

210 
Glu Thr Met 
225 

Asp Ala Thr 

Asp Thr Trp 

Arg Gin Leu 
275 

Gly Gly Asp 
290 

Ala He Thr 
305 

Leu Pro Asn 

Pro Leu Asn 

Asp Leu Ala 
355 

Val Ala Ala 
370 



380 

Lys He 
5 

Glu Glu 
20 

Arg Phe 

Met Phe 

Trp Trp 

Ala Trp 

85 
Tyr Asn 
100 

Met Gly 

Leu Asn 

Val Leu 

Thr Val 
165 
He He 
180 

Ala Thr 

Arg Asp 

Leu Arg 

Leu Leu 
245 
Gly His 
260 

Gin He 

Glu Phe 

Ala Met 

Thr Pro 
325 
Pro Gin 
340 

Leu Tyr 



Met Asn Asp 

Pro Pro Leu 

Ala Arg Arg 
40 

Leu Pro He 
55 

Trp Leu Val 
70 

Gin He Ala 

Leu Lys Thr 

Val Asn Val 
120 

Leu Met Gly 
135 

Met Val Val 
150 

Ser Phe Asn 

Val He Tyr 

Lys Leu Ala 
200 

Gly Met Thr 

215 
Asn Glu Phe 
230 

He He Asp 

Asp Val Gly 

Thr Leu Arg 
280 

Ala Val He 

295 
Leu Arg Val 
310 

Gin Val Thr 

Met Ser His 

Lys Ala Lys 
360 



Glu Asn 
10 

Thr Pro 
25 

Val Arg 

Ala Ser 

Leu Val 

Ser Arg 
90 

Asp Ala 
105 

Leu Pro 

Ala Gly 

Ser Cys 

Ser Ala 
170 
Pro Leu 
185 

Glu His 

Gly Val 

Asp Asn 

He Asp 
250 
Asp Glu 
265 

Gly Ser 

Met Ser 

His Glu 

Leu Arg 
330 
Tyr Arg 
345 

Lys Ala 



Phe Phe Lys Lys 
Gin Asn 
Leu Pro 



Thr Leu 
60 

Gly Trp 
75 

Ala Val 

Val Leu 

Ser Thr 

Gly Pro 
140 
Leu Val 
155 

Pro Leu 

Leu Phe 

Lys Arg 

Tyr Asn 
220 
Cys Arg 
235 

His Phe 

Ala He 

Asp Val 

Gly Thr 
300 
Gly Leu 
315 

He Ser 
Glu Trp 
Gly Arg 



Glu His 

30 
Arg Ala 
45 

Val Ser 



Ala Phe 

Asp Pro 

Ala Gly 
110 
Ala Met 
125 

Arg Leu 

Thr Leu 

Glu Trp 

Gly Trp 
190 
Arg Leu 
205 

Arg Arg 

Arg His 

Lys Ser 

Val Ala 
270 
He Gly 
285 

Pro Ala 

Asn Thr 

Val Gly 

Leu Lys 
350 
Asn Arg 
365 



Ala Ala 
15 

Gin Arg 

Val Gly 

His Pro 

Val Trp 

80 
Leu Ser 
95 

Met Trp 

Leu Met 

Phe Val 

Glu Leu 
160 
Trp Leu 
175 

Val Ser 

Gin Val 

His Trp 

Asn Arg 
240 
He Asn 
255 

Leu Thr 

Arg Phe 

Glu Ser 

Leu Arg 
320 
Val Ala 
335 

Ser Ala 
Thr Glu 
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<210> 
<211> 
<212> 
<213> 



381 
467 
PRT 

E. Coli 



<400> 
Met Asp Val 
1 

Lys Leu Glu 

Val Ser Gly 
35 

Asp Leu Val 
50 

Tyr Gly lie 
65 

Asn Gly Asp 

Asn Val Val 

Thr Gly Ala 
115 

Val Asn Asp 
130 

Asp Leu Gly 
145 

Val Leu Gin 
lie Pro Ser 

Gly Thr Arg 

195 

Ala Trp Val 

210 
lie Asn Tyr 
225 

Lys lie Asp 

Phe Ala Lys 

Ser Gin Thr 
275 

Phe Val Asp 

290 
Ala Thr Met 
305 

Ala Trp Gly 

Ala Gly Tyr 

Gly Asp Asp 
355 

Tyr Asp Ser 

370 
Thr Tyr Ser 
385 



381 

Asn Val 
5 

Leu Thr 
20 

Val Phe 

Asn Asp 

Val Ala 

Val Asp 

85 
Ala Ala 
100 

Gly Ala 

Val Asn 

Ala Tyr 

Gin Met 
165 
Ala Asn 
180 

Leu Thr 

Ser Tyr 

Asp Gin 

Gly Asn 
245 
Gly Asp 
260 

Ala Tyr 

Gly Ser 

Ser Asn 

Phe Gly 
325 
Val Thr 
340 

Tyr Gin 
Met Arg 
Glu Asp 



Asp Gin Phe Asp 
Ser Gly 
Asp lie 



Arg Thr 
55 

Met Asn 
70 

Asn Gly 

Thr Gly 

He Ala 

Ser Asn 
135 
Thr Tyr 
150 

Glu Leu 

Thr Asn 

Asn Ser 

Phe Gly 
215 
Asp Val 
230 

Asn Ala 

Met Asn 

He Tyr 

Leu Ser 
295 
Gly Thr 
310 

Leu Lys 

Pro Tyr 

Leu Ser 

Tyr Glu 
375 
Gin Ala 
390 



Asn He 

25 
His Ser 
40 

Trp Asp 



Ser Asp 

Thr Glu 

Asn Tyr 
105 
Asp Tyr 
120 

Ala Thr 

Gin Ala 

Thr Asp 

He Trp 
185 
Arg His 
200 

Gly Asn 

Asn Gly 

Lys Trp 

Asp Arg 
265 
Ser Ser 
280 

Tyr Ser 

Tyr Val 

Ala Gly 

Gly Ser 
345 
Asn Asp 
360 

Leu Gly 
Leu Thr 



Thr Glu Ala 
10 

Ala Asp His 

Ser Asp Tyr 

Thr Ser Lys 
60 

Gly His Leu 
75 

Leu Asp Asn 
90 

Lys Val Arg 

Lys Asp Lys 

Phe Ser Ala 
140 

Glu Gin Arg 
155 

Tyr Ala Asn 
170 

Asn Leu Glu 

Gly Leu Ala 

Phe Asn Gly 
220 

He Met Val 

235 
He Val Gly 
250 

Ser Gly Gin 

Ala His Phe 

His Phe Asn 
300 

Asp Gly Ser 

315 
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<400> 399 

aagguuaagc cucacgguuc auuaguaccg guuagcucaa cgcaucgcug cgcuuacaca 60 

cccggccuau caacgucguc gucuucaacg uuccuucagg acccuuaaag ggucagggag 120 

aacucaucuc ggggcaaguu ucgugcuuag augcuuucag cacuuaucuc uuccgcauuu 180 

agcuaccggg cagugccauu ggcaugacaa cccgaacacc agugaugcgu ccacuccggu 24 0 

ccucucguac uaggagcagc cccccucagu ucuccagcgc ccacggcaga uagggaccga 300 

acugucucac gacguucuaa acccagcucg cguaccacuu uaaauggcga acagccauac 360 

ccuugggacc uacuucagcc ccaggaugug augagccgac aucgaggugc caaacaccgc 420 

cgucgauaug aacucuuggg cgguaucagc cuguuauccc cggaguaccu uuuauccguu 480 

gagcgauggc ccuuccauuc agaaccaccg gaucacuaug accugcuuuc gcaccugcuc 540 

gcgccgucac gcucgcaguc aagcuggcuu augccauugc acuaaccucc ugauguccga 600 

ccaggauuag ccaaccuucg ugcuccuccg uuacucuuua ggaggagacc gccccaguca 660 

aacuacccac cagacacugu ccgcaacccg gauuacgggu caacguuaga acaucaaaca 720 

uuaaagggug guauuucaag gucggcucca ugcagacugg cguccacacu ucaaagccuc 780 

ccaccuaucc uacacaucaa ggcucaaugu ucagugucaa gcuauaguaa agguucacgg 84 0 

ggucuuuccg ucuugccgcg gguacacugc aucuucacag cgaguucaau uucacugagu 900 

cucgggugga gacagccugg ccaucauuac gccauucgug caggucggaa cuuacccgac 960 

aaggaauuuc gcuaccuuag gaccguuaua guuacggccg ccguuuaccg gggcuucgau 102 0 

caagagcuuc gcuugcgcua accccaucaa uuaaccuucc ggcaccgggc aggcgucaca 1080 

ccguauacgu ccacuuucgu guuugcacag ugcuguguuu uuaauaaaca guugcagcca 1140 

gcugguaucu ucgacugauu ucagcuccau ccgcgaggga ccucaccuac auaucagcgu 1200 

gccuucuccc gaaguuacgg caccauuuug ccuaguuccu ucacccgagu ucucucaagc 1260 

gccuugguau ucucuaccug accaccugug ucgguuuggg guacgauuug auguuaccug 1320 

augcuuagag gcuuuuccug gaagcagggc auuuguugcu ucagcaccgu agugccucgu 1380 

caucacgccu cagccuugau uuuccggauu ugccuggaaa accagccuac acgcuuaaac 144 0 

cgggacaacc gucgcccggc caacauagcc uucuccgucc ccccuucgca guaacaccaa 1500 

guacaggaau auuaaccugu uucccaucga cuacgccuuu cggccucgcc uuaggggucg 1560 

acucacccug ccccgauuaa cguuggacag gaacccuugg ucuuccggcg agcgggcuuu 1620 

ucacccgcuu uaucguuacu uaugucagca uucgcacuuc ugauaccucc agcaugccuc 1680 

acagcacacc uucgcaggcu uacagaacgc uccccuaccc aacaacgcau aagcgucgcu 1740 

gccgcagcuu cggugcaugg uuuagccccg uuacaucuuc cgcgcaggcc gacucgacca 18 00 

gugagcuauu acgcuuucuu uaaaugaugg cugcuucuaa gccaacaucc uggcugucug 18 60 



-218- 



uaucucccgg 
auuagucggu 
caccggguuu 
ggcuccccua 
guacgcaguc 
uuuucacucc 
gucagucagg 
ugucccgccc 
uguaucgcgc 
gcuccccguu 
uuagauguuu 
ugucgaaaca 
uuaccgacgc 
ccguguacgc 



ggccuuccca 
uuguuucccu 
ccgguauucg 



cucuaccccc 



caucguuucc 
cuucacgacg 
caguuugcau 
ggagaugaau 
uuugauuggc 
ucgguccucc 
cgggucuaua 
uucgguuaac 
acacgccuaa 
ccucgccggg 
aguauuuagc 
uacucaucga 
gccuuuccag 
cgcucgccgc 
caguuccccc 
cacuggguuu 
uuaucgcaga 
uuagucgcuu 



cacuuaacca ugacuuuggg accuuagcug gcggucuggg 
gacguuagca cccgccgugu gucucccgug auaacauucu 
cggguuggua ' agucgggaug acccccuugc cgaaacagug 
ucacgaggcg cuaccuaaau agcuuucggg gagaaccagc 
cuuucacccc cagccacaag ucauccgcua auuuuucaac 
aguuaguguu acccaaccuu caaccugccc auggcuagau 
cccugcaacu uaacgcccag uuaagacucg guuucccuuc 
cuugcuacag aauauaaguc gcugacccau uauacaaaag 
gcgugcuccc acugcuugua cguacacggu uucagguucu 
guucuuuucg ccuuucccuc acgguacugg uucacuaucg 
cuuggaggau ggucccccca uauucagaca ggauaccacg 
gcucacagca ugugcauuuu uguguacggg gcugucaccc 
acgcuuccac uaacacacac acugauucag gcucugggcu 
uacuggggga aucucgguug auuucuuuuc cucgggguac 
gguucgccuc auuaaccuau ggauucaguu aaugauagug 
ccccauucgg aaaucgccgg uuauaacggu ucauaucacc 
uuagcacguc cuucaucgcc ucugacugcc agggcaucca 



aacc 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2894 



<210> 400 
<211> 120 
<212> RNA 
<213> E. Coli 

<400> 400 

augccuggca guucccuacu cucgcauggg gagaccccac acuaccaucg gcgcuacggc 60 
guuucacuuc ugaguucggc auggggucag gugggaccac cgcgcuacgg ccgccaggca 120 



<210> 401 

<211> 76 

<212> RNA 

<213> E. Coli 

<400> 401 

guccccuucg ucuagaggcc caggacaccg cccuuucacg gcgguaacag ggguucgaau 60 
ccccuagggg acgcca 7 6 



<210> 402 
<211> 1549 
<212> RNA 
<213> E. Coli 



<400> 402 



aaauugaaga guuugaucau 
gucgaacggu aacaggaagc 
ugucugggaa gcugccugau 
aaugucgcaa gaccaaagag 
ggauuagcuu guuggugggg 
ggaugaccag ccacacugga 
ggaauauugc acaaugggcg 
ucggguugua aaguacuuuc 



ggcucagauu 
agcuugcugc 
ggagggggau 
ggggaccuuc 
uaacggcuca 
acugagacac 
caagccugau 
agcggggagg 



gaacgcuggc 
uucgcugacg 
aacuacugga 
gggccucuug 
ccaaggcgac 
gguccagacu 
gcagccaugc 
aagggaguaa 



ggcaggccua 
aguggcggac 
aacgguagcu 
ccaucggaug 
gaucccuagc 
ccuacgggag 
cgcguguaug 
aguuaauacc 



acacaugcaa 
gggugaguaa 
aauaccgcau 
ugcccagaug 
uggucugaga 
gcagcagugg 
aagaaggccu 
uuugcucauu 



60 
120 
180 
240 
300 
360 
420 
480 
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gacguuaccc 
ggugcaagcg 
agaugugaaa 
cguagagggg 
cgguggcgaa 
aaacaggauu 
ccuugaggcg 
aagguuaaaa 
uucgaugcaa 
gaaugugccu 
aaauguuggg 
ccgggaacuc 
caucauggcc 
gaccucgcga 
cucgacucca 
uucccgggcc 
uagcuuaacc 
acaagguaac 



gcagaagaag 
uuaaucggaa 
uccccgggcu 
gguagaauuc 
ggcggccccc 
agauacccug 
uggcuuccgg 
cucaaaugaa 
cgcgaagaac 
ucgggaaccg 
uuaagucccg 
aaaggagacu 
cuuacgacca 
gagcaagcgg 
ugaagucgga 
uuguacacac 
uucgggaggg 
cguaggggaa 



caccggcuaa 
uuacugggcg 
caaccuggga 
cagguguagc 
uggacgaaga 
guaguccacg 
agcuaacgcg 
uugacggggg 
cuuaccuggu 
ugagacaggu 
caacgagcgc 
gccagugaua 
gggcuacaca 
accucauaaa 
aucgcuagua 
cgcccgucac 
cgcuuaccac 
ccugcgguug 



cuccgugcca gcagccgcgg uaauacggag 
uaaagcgcac gcaggcgggu ugguuaaguc 
acugcaucug auacuggcaa gcuugagucu 
ggugaaaugc guagagaucu ggaggaauac 
cugacgcuca ggugcgaaag cguggggagc 
ccguaaacga ugucgacuug gagguugugc 
uuaagucgac cgccugggga guacggccgc 
cccgcacaag cgguggagca ugugguuuaa 
cuugacaucc acggaaguuu ucagagauga 
gcugcauggc ugucgucagc ucguguugug 
aacccuuauc cuuuguugcc agcgguccgg 
aacuggagga agguggggau gacgucaagu 
cgugcuacaa uggcgcauac aaagagaagc 
gugcgucgua guccggauug gagucugcaa 
aucguggauc agaaugccac ggugaauacg 
accaugggag uggguugcaa aagaaguagg 
uuugugauuc augacugggg ugaagucgua 
gaucaccucc uuaccuuaa 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1549 



<210> 403 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer Oligonucleotide 
<400> 403 

tgtttarcag accgctt 17 

<210> 404 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer Oligonucleotide 
<400> 404 

acaatttcac acagcctc 18 



<210> 405 
<211> 159 
<212> DNA 

<213> Escherichia coli 
<400> 405 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 60 
atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 120 
aaaacgttca tattgtacct ttttgattaa ccattgggg 159 

<210> 406 
<211> 640 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> misc_f eature 
<222> (1) . . . (640) 
<223> n = A, T, C or G 



<400> 406 

ggggnccaaa gtgtttgggn cgggcaactg gaggccaacc ttaanttngg ggaaattttt 60 

aanaaaaggc ggggatttgt nagccacggg ngattanttt anaataaatt aagtgttgcc 120 

ataaggggac aaagngaagg aagtggntat taanggannc gccaatgcga nttagggcag 180 

accattcggc cattcgcctt cttggttatc gaagttcatc cagatagccg ttgccngacc 24 0 

gaccagattc gcttcnggca caaagcccca gtaacggctg tccgcgctgt tgtcgcggtt 300 

gtcgcccatc atgaagtatt gtcccggagg aacaatccag gttgccagtt gttgccctgg 360 

ctgctggtaa tacatcccca cctgatcctg cgcaatcggc actgtcagaa tgcggtgcgt 420 

cacatcaccc agtgtctctt tacgctcgga aagacgaatt ccattttctt tggtttcgtt 480 

tttcggcact tcaaagaatc cgctggtcgc ttccccacca ttacggcgtg agaaggtctg 54 0 

aacgaaatcg ctcggttcca cgtttgagta ggtgaccggc agcgcgtttt cacacgcctg 600 

gccggaactg catcccggtt gaatcgtcag ctcttttgag 64 0 



<210> 407 
<211> 682 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (1) . . . (682) 
<223> n = A,T,C or G 

<400> 407 

cctgcagggt aatgtcgcca ttaaactggc gcaggcagcc aaagagttgc tccgcttcta 
cccagtcggc agcgacaact tgcgttaaag tcgcaaaatt atcatctgca ctcactgcgt 
gacgtaagcg gatggagtgg ccggaaacct catagtgacc gcccaccagt tggcctgcat 
cgctttgtag cgtacgcgcg gcattggcaa taagattcag atactcagac tcttccgggg 
ccttcgccag cataaaagag gaggatgctc gcgtatgcag caactgctcc agcgcaaatt 
gcagccgcgg ttgagtatca ctgaataaag gatcgttttc gtcaatcaaa tgtggctgag 
caaatatttc ctgatagcta tcggtatcag gaaccaggtc acgccatgca agtttcgtaa 
tggtcaaagt tgatgttttt tagtctgttg tcaaagccgc nattataccn gtaaccggca 
ctacagcaca cgtagaaagc acccgacaat actcctggca tgggcgttaa agctcacagg 
atggagatct tttcttcact ggcctaaaaa gctgatattc tgtaaagagt tacacngtaa 
cattgagatc gctatgaaat atcaacaact tggaaaatct tgnaaagcng gttggaaaat 
ggaaagtatc tggttaagaa gc 

<210> 408 
<211> 309 
<212> DNA 

<213> Escherichia coli 



<400> 408 

ggggatccgg cagaatttta cgctgaccaa tgacgcgacg acgtggcatg gaaatactcc 60 

gttgttaatt caggattgtc caaaactcta cgagtttagt ttgacattta agttaaaacg 120 

tttggcctta cttaacggag aaccattaag ccttaggacg cttcacgcca tacttggaac 180 

gagcctgctt acggtcttta acgccggagc agtcaagcgc accacgtacg gtgtggtaac 240 

gaacacccgg gaggtcttta acacgaccgc cacggatcag gatcacggag tgctcctgca 300 

gccaagctt 309 



<210> 409 
<211> 1167 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
682 



<212> DNA 

<213> Escherichia coli 



<400> 409 

gtcgacccat 

tggaacactt 

cagaacggac 

cagcagtagt 

ctaaaaagtc 

ttaacttcaa 

ttgctcacgc 

agacccaggc 

gctttcagaa 

acagctacag 

tacaacgtcc 

catttaaatk 

taactgcgaw 

gccagccgaa 

cagagttgcc 

tttgacctca 

cgggtgttcc 

gaacggagta 

gccagcttcg 

aatccgcaac 



ctgtccattg 
tccgcaatgc 
gtcctgtcgc 
caaacagtgt 
accagccatc 
cttcagcgcc 
cttctttcag 
cagttgcgcc 
ttacgtcgaa 
cagcagcagc 
attacagaca 
gttcctgaat 
taagcagctt 
gcttctttca 
aggcggtcga 
aattttgcat 
atagagtatg 
ccttcaacag 
cgacctgctt 
tactgcaagc 



agcggacagt 
ctgttgctat 
agtatattaa 
gaaacgctac 
agcctgattt 
agcttcttcc 
agcagccggt 
acgtactgct 
ttcagttttt 
ggaaacaccg 
tagctgcaac 
atcagaataa 
ytttcgcatc 
tggttgccat 
tctgagacgc 
tcgctttcgc 
caatcagggt 
cacggcgcag 
tacgcagttc 
caagctt 



ttgtgcaaca 
cacgcttaaa 
gtcgtcgata 
tggcgcctta 
ctcaggctgc 
agagcttttt 
gcagattcta 
ttgataacag 
tcttcagcag 
aatttttctt 
tgcttcaatg 
gtttatacgt 
gcgtacagma 
caggcgtgca 
cgggatcagc 
gaactctttg 
cggaccaaca 
cagggtgtta 
agtcatttta 



ctattttgtt 
ccatttcatt 
gaaacaagca 
cagcgcaaaa 
aaccggaagg 
tcagtgcttc 
ccaggtcttt 
caactttgtt 
cttcaaccgg 
ycattgcaga 
awttgatctt 
aagcgaatgc 
gccagagtac 
attgcttctt 
tcaccttcaa 
aacagacgag 
aacgcgtctt 
cgaacaacac 
tctacagtta 



gaccggaaaa 
gcgatttaca 
ttgaaaggca 
aggctggtga 
gttggcttat 
tgcgtcgtct 
agcttctttc 
agcgccagca 
gccagcagct 
gatcaagttc 
tagtgataga 
gttaaaaaga 
gaaccagttt 
cgtaggtcgg 
aggcagcggc 
cagcagcgcc 
tcaggcactc 
gcatgtatac 
cgcccacggg 



<210> 410 
<211> 404 
<212> DNA 

<213> Escherichia coli 



<400> 410 

caacmctatt 

cttaaaccat 

tcgatagaaa 

gccttacagc 

ggctgcaacc 

cttttttcag 

attctaccag 



ttgktggacc 
ttcattgcga 
caagcattga 
gcaaaaaggc 
ggaagggttg 
tgcttctgcg 
gtctttagct 



ggaaaakgga 
tttacacaga 
aaggcacagc 
tggtgactaa 
gcttatttaa 
tcgtctttgc 
tctttcagac 



acactttccg 
acggacgtcc 
agtagtcaaa 
aaagtcacca 
cttcaacttc 
tcacgccttc 
ccaggccagt 



cawkgcctgt 
tgtcgcagta 
cagtgtgaaa 
gccatcagcc 
agcgccagct 
tttcagagca 
tgcg 



tgctatcacg 
tattaagtcg 
cgctactggc 
tgatttctca 
tcttccagag 
gccggtgcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1167 



60 
120 
180 
240 
300 
360 
404 



<210> 411 
<211> 152 
<212> DNA 

<213> Escherichia coli 
<400> 411 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcaa gagcagcccg 60 
gtgcagattc taccaggtct ttagcttctt tcagacccag gccagttgcg ccacgtactg 120 
ctttgataac agcaactttg ttagcgccag ca 152 

<210> 412 
<211> 825 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (825) 
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<223> n = A, T,C or G 



<400> 412 

gatccgtcga cccatctgtc cattgagcgg acagtttgtg caacactatt ttgttgaccg 60 

gaaaatggaa cactttccgc aatgcctgtt gctatcacgc ttaamccatt tcattgcgat 120 

ttacacagaa cggacgtcct gtcgcagtat attaagtcgt cgatagaaac aagcattgaa 180 

aggcacagca gtagtcaaac agtgtgaaac gctactggcg ccttacagcg caaaaaggct 240 

ggtgactaaa aagtcaccag ccatcagcct gatttctcag gctgcaaccg gaagggttgg 300 

cttatttaac ttcaacttca gcgccagctt cttccagagc ttttttcagt gcttctgcgt 360 

cgtctttgct cacgccttct ttcagagcag ccgggtgcag attctaccag gtctttagct 420 

tctttcagac ccaggccagt tgcgccacgt actgctttga taacagcaac tttgttagcg 480 

ccagcagctt tcagaattac gtcgaattca agttttttct tcagcagctt caaccgggcc 54 0 

agcagctaca gctacagcag cagcagcgga aacaccgaat ttttcttyca ttggcagaga 600 

tcaagttcta caacgtccat tacagacata gctgcaactg cttcaatgat tkgatcttwa 660 

gtgatagaca tttaaattgt tcctgaatat cagaataagt ttatacgtaa gcgaatgcgt 720 

taaaaagata actgcgatta agcagcttct ttcgcatcgc gtacagcagc cagaggtcga 780 

accagtttgc cagccgaagg ttggcttttc agcctnnncn natta 825 



<210> 413 
<211> 425 
<212> DNA 

<213> Escherichia coli 



<400> 413 

agtagtcaaa 

aaaagtcacc 

aacttcaact 

gctcacgcct 

acccaggcca 

tttcagaatt 

agctacagca 

acaac 



caggtgkgra 
agccatcarc 
tcagcgccag 
tctttcagag 
gttgcgccac 
acgtcgaatt 
gcagcagcgg 



acgctactgg 
ctgatttctc 
cttcttccag 
cagccggtgc 
gtactgcttt 
cagttttttc 
aaacacccga 



cgccttacag 


cgcaaaaagg 


ctggtgacta 


60 


aggctgcaac 


ccggaagggt 


tggcttattt 


120 


agcttttttc 


agtgcttctg 


cgtcgtcttt 


180 


agattctacc 


aggtctttag 


cttctttcag 


240 


gataacagca 


actttgttag 


cgccagcagc 


300 


ttcagcagct 


tcaaccgggc 


cagcagctac 


360 


atttttcttc 


cattgcagag 


atcaagttct 


420 








425 



<210> 414 
<211> 126 
<212> DNA 

<213> Escherichia coli 



<400> 414 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 60 

gcagattcta ccaggtcttt agcttctttc agacccaggc cagttgcgcc acgtactgct 120 

ttrata 126 



<210> 415 
<211> 264 
<212> DNA 

<213> Escherichia coli 



<400> 415 

ctgcmacccg gargggttgg cttatttaac ttcaacttca gcgccagctt cttycagagc 60 

ttttttcaag tgcttctgcg tcgtctttgc tcacgccttc tttcagagca gccggtgcag 120 

attctaccag gtctttagct tctttcagac ccaggccagt tgcgccacgt actgctttga 180 

taacagcaac tttgttagcg ccagcagctt tcagaattac gtcgaattca gttttttctt 240 

cagcagcttc aaccgggcca gcag 2 64 



<210> 416 
<211> 201 
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<212> DNA 

<213> Escherichia coli 



<400> 416 



cgcataccct gcagcatcgg cccgatggag 
taggtggtgt taccggtgtt cagatccggg 
ggagagttcg gcgctttgga tttcgcaacg 
ggaccgtcga tcatcaggtc a 



atcaggtcgg cagaacgctg taccgctttg 
aagatgaaca cggtagcgcg acctgcaacc 
tcagccatta ccgcagcgtc gtactgcagc 



60 
120 
180 
201 



<210> 417 
<211> 239 
<212> DNA 

<213> Escherichia coli 
<400> 417 

aattcagcag ttgacagtgg cataaacgta actggtgact tttgcccggc atgacgccgg 60 
gcttttttta ttattccgtg acttccagcg tagtgaaggc aaacttctcg ccatcaaata 120 
gcccctgact ggttagtttt agcgcgggga tcactggcag agaaagaaac gccatctgaa 180 
taaacggctc atcgggtaac ggaccgcatt cacgggcggc ggctttcaag gcgtcaatt 239 

<210> 418 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 418 

ttcttttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 60 
ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 120 
ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 180 
aatactattt aaatattatt ttgagcatat gcacataagg ttg 223 

<210> 419 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 419 

ttcttttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 60 
ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 120 
ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 180 
aatactattt aaatattatt ttgagcatat gcacataagg ttg 223 

<210> 420 
<211> 212 
<212> DNA 

<213> Escherichia coli 
<400> 420 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 60 
cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 120 
tatgccgtaa cgmcaatgaa ctcgggaatt agtataagca gcgcgagaat aataatcatt 180 
gtgcaaatgc taatttaatt aatactattt aa 212 

<210> 421 
<211> 438 
<212> DNA 

<213> Escherichia coli 
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<400> 421 

ccctgtaaat 

caggccataa 

cacagcaaca 
atctttagcc 
gtcgcgacac 
gcgcacattc 
aagcgttcgs 



tatcgcccgt 
atgccaccag 
agagctgttt 
tgatgcctct 
acgcgaccag 
aagcaacagg 
cgggtaatcg 
gcccatkg 



ggcataaaaa 
aattatcgtc 
cagttcggcg 
gtacaaccct 
gacaccgttg 
tagtgaatat 
aaccccactt 



ctgcgtccaa 


acgccgtctt 


tgccagcagc 


60 


aaccaaccaa 


ttgctgaaac 


gccaagcagc 


120 


ggtaaccctt 


caatccattt 


gccgccagtc 


180 


aacgtgccaa 


gggtggcaac 


aatggcaggg 


240 


aaaaatcccg 


cgagcaaacc 


aagcagtaaa 


300 


cctgcgttca 


gtaacatccc 


caacagcacc 


360 


gaaacatcaa 


tattgsgsgt 


aagcattwcc 


420 
438 



<210> 422 
<211> 682 
<212> DNA 

<213> Escherichia coli 



<400> 422 

aattcccggg 

cggtaagtat 

tacgcttaag 

gcggtatcag 

tggtgaatgc 

ttctctgtat 

catcacaaaa 

gtctgaaggt 

cattaacaag 

ggctcgttta 

ctgacagttt 

atgcaggagt 



gatccgtcga 
ggcggggtta 
tgacaacccc 
ttttactccg 
ggctgagcgc 
ccggcgttaa 
ttcattgttg 
tgttttgaaa 
agaaaacaag 
cgtctgatgc 
ttcttgtttc 
gatgactgca 



ccgtgcgctt 
ttccttcccc 
gctgcaacgc 
tgactgctct 
acgcggaaca 
ttgttaactg 
aggacgcgat 
ttggtgtcac 
aacgggagct 
caaaaggatg 
cggagagcca 
go 



ccggttgtgg 


caacccgcga 


aatggcgcgg 


60 


gttgaggaca 


ccgggttgtc 


aggttgacca 


120 


cctctgttat 


caattttctg 


gtgacgtttg 


180 


gccgcccttt 


ttaaagtgaa 


ttttgtgatg 


240 


gttaaaacca 


aaaacagtgt 


tatgggtgga 


300 


gttaacgtca 


cctggaggca 


ccaggcactg 


360 


aatgaaaacg 


ttattaccaa 


acgttaatac 


420 


tatcagtaac 


ccagtattta 


ctgaagatgc 


480 


attaaataaa 


atatgcattg 


tttcaatgct 


540 


tgcacaatga 


attcagcatt 


tgtgcttgtt 


600 


gttgatattg 


cagtcagtgt 


tcacaggaca 


660 
682 



<210> 423 
<211> 600 
<212> DNA 

<213> Escherichia coli 



<400> 423 

ggggatccga 

tgcggctgag 

tatccggcgt 

aaattcattg 

ggttgttttg 

aagagaaaac 

ttacgtctga 

tttttcttgt 

agtgtatgac 

ttattcacca 



ttgtgactgc 
cgcacgcgga 
taattgttaa 
ttgaggacgc 
aaattggtgt 
aagaacggga 
tgccaaaagg 
ttccggagag 
tgcagcaacc 
ggataatatc 



tctgccgccc 
acagttaaaa 
ctggttaacg 
gataatgaaa 
cactatcagt 
gctattaaat 
atgtgcacaa 
ccagttgata 
gaacagaaaa 
gaaatcccgg 



tttttaaagt 
ccaaaaacag 
tcacctggag 
acgttattac 
aacccagtat 
aaaatatgca 
tgaattcagc 
ttgcagtcag 
ttcccggtaa 
caggtcttta 



gaattttgtg 
tgttatgggt 
gcaccaggca 
caaacgttaa 
ttactgaaga 
ttgtttcaat 
atttgtgctt 
tgttcacagg 
ctgttacccg 
aacagttccg 



atgtggtgaa 
ggattctctg 
ctgcatcaca 
tacgtctgaa 
tgccattaac 
gctggctcgt 
gttctgacag 
acaatgcagg 
gtcgataaag 
taataaataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



<210> 424 
<211> 100 
<212> DNA 

<213> Escherichia coli 
<400> 424 

gggatccagc aagaagatgc ggttgtaccg tcatcacgca gatgcgcaaa gctactcagc 60 
aactgacctt tcttcgcaat aagcacgcca ttagcgtcat 100 



<210> 425 



"225- 



<211> 465 
<212> DNA 

<213> Escherichia coli 



<400> 425 

tcgcgtgttt accttcaaca tcggtaactt tctggcggat agtttcacgg taagcaacct 60 

gcggtttacc tacgttcgct tcaacgttga attcacgctt catacggtca acgatgatgt 120 

cgaggtgcag ttcgcccata cccgcgatga tggtctggtt agattcttcg tcagtccata 180 

cacggaaaga cgggtcttct ttagccagac ggcccagagc cagacccatt ttttcctggt 24 0 

cagctttggt tttcggttca actgcgatgg agattaccgg ctcagggaat tccatacgtt 300 

ccagaatgat cggcgcatcc gggtcacaca gggtgtcacc agtggttacg tctttcagac 360 

cgatagcagc agcgatgtcg cccgcgcgaa cttctttgat ctcttcacgt ttgttagcgt 420 

gcatctgaac gatacgaccg aaacgctcac gtgcagcttt cacgg 4 65 



<210> 426 
<211> 653 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (653) 
<223> n = A,T,C or G 



<400> 426 

tgatcggctc aagcagaact ggtttcgctt tcttaaagcc ttctttaaag gcgatagaag 60 

cagccagttt aaacgccagt tcagaggagt caacgtcatg gtaagaaccg aagtgcagac 120 

gaatacccat gtctactacc gggtagcctg ccagcggacc tgctttcagc tgttcctgga 180 

tacctttatc aacggccggg atgtattcgc cagggattac accaccttta atgtcgttga 240 

tgaactcgta gcctttcggg tttgaacccg gctccagcgg gtacatgtcg ataacaacat 300 

gaccatactg accacgacca ccagactgtt tcgcgtgttt accttcaaca tcggtaactt 360 

tctggcggat agtttcacgg taagcaacct gcggtttacc tacgttcgct tcaacgttga 420 

attcacgctt catacggtca acgatgatgt cgaggtgcag ttcgccatac ccgcgatgat 480 

ggctgggtag attcttcgtc agtccataca cggnaagacg ggtcttnttt agccagacgg 540 

gccagagnca gacccatttt tttctggcag ctttggnttc ggtcaactgc gatggaaata 600 

cccggctcaa ggaattcata cgtttcanaa tgatcggggc attccgggtc aca 653 



<210> 427 
<211> 268 
<212> DNA 

<213> Escherichia coli 



<400> 427 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg 60 

agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 120 

ctgccagcgg acctgctttc agctgttcct ggataccttt atcaacggcc gggatgtatt 180 

cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 240 

ccggctccag cgggtacatg tcgataac 268 



<210> 428 
<211> 330 
<212> DNA 

<213> Escherichia coli 
<400> 428 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 
atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 
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ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgcgtccg gtgaaccagt 180 

cggacgcacc tttaataact ataaataagt gtctgggcag atactatata aattaactta 240 

gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag gcttcccagt 300 

aatataatta atactctact tccagagtag 330 

<210> 429 
<211> 465 
<212> DNA 

<213> Escherichia coli 



<400> 429 

gttttgggga 

atgactgatt 

ctgatccttc 

cggacgcacc 

gtgaatgatt 

aatataatta 

gcacaaattt 

tattggggat 



gatgtaaggg 
gccgatacct 
tgttcttata 
tttaataact 
atgctaatgt 
atactctact 
atcccacaac 
cgggcacacc 



ctaatctgaa 
gattaaacgg 
acacaaggaa 
ataaataagt 
catcaattaa 
tccagagtag 
tgttcttctg 
ttctggcagg 



tggctgcatt 
gtcatcaaaa 
acgtacttaa 
gtctgggcag 
ataaatataa 
aatattaaat 
tctcgacatg 
ttggtgtctc 



ccttgtttaa 
tcatcattgc 
ggtgcgtccg 
atactatata 
tggcgttaag 
tttatccgcg 
cgccggatct 
gtagt 



ggaaaaacga 
tgttttacag 
gtgaaccagt 
aattaactta 
gcttcccagt 
tggtgcatca 
ttcacaatag 



60 
120 
180 
240 
300 
360 
420 
465 



<210> 430 
<211> 379 
<212> DNA 

<213> Escherichia coli 



<400> 430 

aatctgaatg 

ttaaacgggt 

acaaggaaac 

aaataagtgt 

tcaattaaat 

cagagtagaa 

ttcttctgtc 



gctgcattcc 
catcaaaatc 
gtacttaagg 
ctgggcagat 
aaatataatg 
tattaaattt 
tcgacatgc 



ttgtttaagg 
atcattgctg 
tgcgtccggt 
actatataaa 
gcgttaaggc 
tatccgcgtg 



aaaaacgaat 
ttttacagct 
gaaccagtcg 
ttaacttagt 
ttcccagtaa 
gtgcatcagc 



gactgattgc 
gatccttctg 
gacgcacctt 
gaatgattat 
tataattaat 
acaaatttat 



cgatacctga 
ttcttataac 
taataactat 
gctaatgtca 
actctacttc 
cccacaactg 



60 
120 
180 
240 
300 
360 
379 



<210> 431 
<211> 443 
<212> DNA 

<213> Escherichia coli 



<400> 431 

aagatgatgt 

aaaaaacact 

attagtttat 

tgggtatcaa 

aatcccattg 

aaaccaatat 

ccctgcatac 

agataaatgt 



gatgagaaag 
aaatcaaaaa 
ttcaaatgag 
ataaacaaca 
cctggcggat 
cacgcagcag 
cagccagcat 
actgcccgat 



tcaatttgaa 
ataatggcat 
gaaaatctcc 
gaggagaaat 
aagttgcggc 
ttttttcgcc 
caacgccgca 

gtc 



taagacaata 
tagaaaatat 
cggcgaaaaa 
ttttaacgca 
ttaacgccag 
ggattggtac 
ctgtgcttgc 



ttaagagcta 
aatgcgaaaa 
accgggagat 
gccattcagg 
gaagcgtgtc 
cggaaaacag 
ggctacgctc 



aaaaaatgtc 
cggaggtgaa 
gaaagtgtga 
caaatcgttt 
ggccagtttc 
atcgcggaat 
atagcgacgc 



60 
120 
180 
240 
300 
360 
420 
443 



<210> 432 
<211> 638 
<212> DNA 

<213> Escherichia coli 



<400> 432 

cagggggttt gttgtgggca atgatgcatt taagttatcg tctgcagata gaggagatat 60 
tacaataaac aacgaatcag ggcatttgat agtcaatacc gcaattctat caggagatat 120 
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agtcactcta 
cgcgcaattt 
taaattaaaa 
taaatgcttg 
tccaaaaaga 
agcaataaaa 
gttagtaata 
atggaaataa 
ggtgcttggt 



agaggaggag 
aaacttagtg 
aaacgactgt 
cttaaatctc 
gatactacaa 
aataataaca 
gtcgttagta 
ttttctttat 
ttgctaacca 



aaattaggtt 
ctttacatcg 
tatgtataag 
tatatccttc 
ataaagatgc 
atgatataaa 
tgtttgattc 
actgagacat 
ataaaaagag 



ggtattatag 
ctattgtctt 
caaaggtccg 
cccgaaaaat 
ctttatttta 
tctaatgttt 
tccatatatt 
cacaccatca 
tgcattcg 



cttgtgcgcg 
gatttctttg 
aacgaaaaat 
gacacataaa 
ttatttctaa 
ttaaatatat 
acgtgtagtt 
tcaaatggaa 



ccatgattgg 
aattatttta 
acattccaaa 
attgagatat 
taaaaataga 
tgtcttttat 
ttttatatac 
gtttgaagat 



<210> 433 
<211> 299 
<212> DNA 

<213> Escherichia coli 



<400> 433 

ctttacctgg 

acagaatgcc 

aaagcatagc 

cgatggcgca 

ttcgttttcc 



catgatccac 
cattgtttcc 
acaacatcgg 
gaaattgcgc 
gtgtcataag 



ttcgccagaa 
tcacttatct 
gagggcaaga 
catcaacgat 
aacgtacggt 



taccggcaat 
gttttgcatt 
tttgtgacga 
cagtgataat 
attcaccaga 



aagcccaaaa 
agcgggttag 
gcatcacgga 
taccaaccac 
tcttttatca 



ataatccatg 
tcgctgataa 
ggtttttttg 
aaacatcatg 
cttcagccg 



<210> 434 
<211> 388 
<212> DNA 

<213> Escherichia coli 



180 
240 
300 
360 
420 
480 
540 
600 
638 



60 
120 
180 
240 
299 



<400> 434 

gaaaaaggag 

ttattgtgct 

atcgacctga 

cgcggcatcg 

gcgcacgcta 

atgaatcgcg 

cacgcccagc 



gcaatatcgg 
gcaggtgttt 
tcatcaaact 
gctttcatca 
tataccggcc 
ttatcaatcg 
gacacagaac 



gtaaaggcat 
tagcgggttg 
gaatagcggc 
tccgcaccat 
ccagtttacg 
ctgccttacg 
gaattccc 



tagcccgacg 
ttgatccaca 
ctgctcgtaa 
tgggctgggc 
atgaaagccg 
cgctttgtct 



aatacgtcgg 
ggttctaact 
gtttcctggg 
tgatagttgg 
ttcgccagtt 
ttataggcat 



gctacaaata 
ggaagaccac 
cggacaccgg 
aaacatggta 
cctgcgcctg 
ccggctgcgc 



60 
120 
180 
240 
300 
360 
388 



<210> 435 
<211> 351 
<212> DNA 

<213> Escherichia coli 



<400> 435 

ctatccttga 

acagaaaaag 

atattattgt 

cacatcgacc 

cggcgcggca 

gtagcgcacg 



tgaaaccgcg agcaaagata ggtgattacg tcatggtttt acagaaaatt 60 

gaggcaatat cgggtaaagg cattagcccg acgaatacgt cgggctacaa 120 

gctgcaggtg ttttagcggg ttgttgatcc acaggttcta actggaagac 180 

tgatcatcaa actgaatagc ggcctgctcg taagtttcct gggcggacac 240 

tcggctttca tcatccgcac cattgggctg ggctgatagt tggaaacatg 300 

ctatataccg gccccagttt acgatgaaag ccgttcgcca g 351 



<210> 436 
<211> 762 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc feature 
<222> (1) .7. (762) 
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<223> n = A,T,C or G 



<400> 436 

aattatgaaa cactgtctgg aatcgtctga atgacgggca catttgcgag cacgcatcca 60 

gtaataacac aggaaactat tttatctacg cgttagcgat agactgcttg catggcgaaa 120 

ggaggtaagc cgacgatttc agcgggacgc tgaaacggga aagcccctcc cgaggaaggg 18 0 

gccataaata aggaaagggt catgatgaag ctactcatca tcgtggtgct cttagtcata 240 

agcttccccg cttactaaga ctaccagggc gggggaaacc ccgctctacc ctcactcctg 300 

aaagtatgcc ttcacgataa gattgtcaat ccgcaggctt tgtagtctgc gatcctgcca 360 

gcaaatattc tttgcgagtc gttacgcaat aatcacagag gaaactattt tattcacgcg 420 

ttagcgatag actgcattca gggcgaaagg aggtaagccg atgatttcag cgggacgctg 480 

aaacgggaaa gcctctcccg gagaagaggg cttttaataa ggaaagggtt atgatgaagc 54 0 

acgtcatcat actggtgata ctcttagtga ttagcttcca ggcttactaa gaacaccagg 600 

gggaggggga aacctcttcc taaccctcac ttctgaaatt gggtgctatg acgctggcgt 660 

tactgcttan cgctaccagt ttgtctgccc tggcggttgt aacgccagat cggtacccgt 720 

ttggatattt taatgaaagc cgacaaatca atcancgtga eg 762 



<210> 437 
<211> 292 
<212> DNA 

<213> Escherichia coli 
<400> 437 

cacatttgcg agcacgcatc cagtaataac acaggaaact attttatcta cgcgttagcg 60 
atagactgct tgcatggcga aaggaggtaa gccgacgatt tcagcgggac gctgaaacgg 120 
gaaagcccct cccgaggaag gggccataaa taaggaaagg gtcatgatga agctactcat 180 
catcgtggtg ctcttagtca taagcttccc cgcttactaa gactaccagg gcgggggaaa 240 
ccccgctcta ccctcactcc tgaaagtatg ccttcacgat aagattgtca at 292 

<210> 438 
<211> 631 
<212> DNA 

<213> Escherichia coli 
<400> 438 

atttacactt tttacgaaat catgggatca ctaacaaaat atcgcttgtc agttatattg 60 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 120 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 180 

cagccaccgc gggttcgacc aatacctcgg gaatttctaa gtatgagtta agtagtttca 240 

ttgctgactt taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 300 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggacgcact 360 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac cgacggtaaa atcattaaag 420 

cctacgacgg tgagattttt tatcatcgct aaaaaaagcc ccctcatcat gagggggaaa 480 

tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 540 

tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 600 

tagacgtagt aactggctgt tatcggaatt g 631 

<210> 439 
<211> 566 
<212> DNA 

<213> Escherichia coli 
<400> 439 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 60 
ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 120 
cagccaccgc gggttcgacc aatacctcgg gaatttctaa gtatgagtta agtagtttca 180 
ttgctgactt taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 240 
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agtacaacat 
ggacctatat 
cctacgacgg 
tgcagacacc 
tatttcacgt 
tagacgtagt 



taagcagtgg 
gggtggcgcg 
tgagattttt 
ttgttatttt 
tgattaatgc 
aactggctgt 



cagttgcgta 
tacgtgttga 
tatcatcgct 
ttattattag 
ggttgcctcc 
atcgaa 



acctgcccgc 
tcagcgacac 
aaaaaaagcc 
ccacttgctc 
agtgcgccag 



gcctgatgcc 
cgacggtaaa 
ccctcatcat 
gtcttgcttg 
atttaacttt 



gggacgcact 
atcattaaag 
gagggggaaa 
ttattagtcg 
gtttgtatcg 



300 
360 
420 
480 
540 
566 



<210> 440 
<211> 339 
<212> DNA 

<213> Escherichia coli 



<400> 440 

cgtattcaca 

atcttcatta 

ggagcataaa 

tcggcactga 

tgcagccttt 

aatcgcacca 



tccttttgat 
cagcggtcgt 
gatgaaaaaa 
gctgggatgg 
gtgcaattta 
ataacggcaa 



tggtgataac 
atttttagca 
acaacgatta 
tggtaacgtc 
agcgttaact 
ccacgaagct 



atgcgaatcg 
tggtttttta 
ttatgatggg 
acctctaaaa 
tttaatcttc 
gccaaaatt 



gtattatttt 
ttggcggcta 
tgtggcgatt 
aatagcaaag 
ctgtagataa 



tccggttgta 
tgctgccccg 
attgtcgtac 
gctgcctgtg 
atagcacgac 



<210> 441 
<211> 376 
<212> DNA 

<213> Escherichia coli 



<400> 441 

catgaatatt 

cataaaatat 

acaaagaagc 

cttttaattt 

catacataaa 

gtccatctct 

ttaaaacaac 



taaaaaggaa 
ttccgtggag 
attgaatgca 
aacatgtaaa 
tggagtcatg 
aattgcatat 
tgtaca 



aacgacatga 
cattttatta 
gggaaaaata 
cgcatggtta 
ttttcccttt 
tttaattttt 



aaccgaagca 
ttgaatatag 
atatggccat 
atcctcatat 
tccatttatc 
ctgataaatg 



<210> 442 
<211> 446 
<212> DNA 

<213> Escherichia coli 



cagaatcaac 
aggtttaact 
aaaaaacatc 
cacgggtgga 
aagttcctgt 
gcattgagca 



attctccaat 
ccggtaaaaa 
gaaagaaact 
gtgttaagaa 
tgccgtttta 
tcgatttcat 



60 
120 
180 
240 
300 
339 



60 
120 
180 
240 
300 
360 
376 



<400> 442 

ttacgatagc 

tgactacctt 

tatttaaaaa 

atatttccgt 

aagcattgaa 

atttaacatg 

taaatggagt 

ctctaattgc 



tattagtaaa 
cgtttttttg 
ggaaaacgac 
ggagcatttt 
tgcagggaaa 
taaacgcatg 
catgttttcc 
atattttaat 



aatataagag 
attaagaatg 
atgaaaccga 
attattgaat 
aataatatgg 
gttaatcctc 
cttttccatt 
ttttct 



<210> 443 
<211> 388 
<212> DNA 

<213> Escherichia coli 



ttagctgtat 
attttattat 
agcacagaat 
atagaggttt 
ccataaaaaa 
atatcacggg 
tatcaagttc 



tgttatgtct 
cgtaagtaaa 
caacattctc 
aactccggta 
catcgaaaga 
tggagtgtta 
ctgttgccgt 



gtggcgaaat 
attacatgaa 
caatcataaa 
aaaaacaaag 
aactctttta 
agaacataca 
tttagtccat 



60 
120 
180 
240 
300 
360 
420 
446 



<220> 

<221> misc_feature 
<222> (1) . . . (388) 
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<223> n - A,T,C or G 



<400> 443 

tcaccccggt 

ggaaaacaaa 

cggactttta 

tcacaatagg 

tcgcatcttt 

ccgctttttc 

cncgcttttc 



gccgattttc 
gagataaatg 
cccgcctggt 
ctgtgtagcc 
atctttggct 
gcgctccaga 
ttcctgacga 



aggcatcctg 
tctaatcctg 
ttattaattg 
tgggcctgtt 
gcaggttgaa 
cgagcctggc 
atagccgc 



atttaactta 


gcacccgcaa 


cttaactaca 


60 


atgcaaatcg 


agccgatttt 


ttaatcttta 


120 


cactgtnatc 


cgggcgttcg 


cccgctttaa 


180 


tctctttcac 


ccgcgccaga 


gcggcagcaa 


240 


cggctgcgct 


cttatgtcgt 


tcaaggcgag 


300 


gcgcttcgaa 


acgcgctttg 


gcttctgcgg 


360 
388 



<210> 444 
<211> 209 
<212> DNA 

<213> Escherichia coli 
<400> 444 

aattttaata acgctatctg cggataaagc agaataggtg gttaacccca gacataaacc 
gaggaaaata atgttattgt atttcataat ctattgttcc ttagcgacag attgctgtct 
gctggttcag taaggtacca ggagaaactt caggaagctt gtactcgaca atacagtttg 
agtttttatc tttgccccat gaaacctgt 

<210> 445 
<211> 341 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
209 



<400> 445 

catcctcaat 

acgtaatctg 

tgtttttttg 

cgaggacgcg 

accccagaca 

cgacagattg 



accgttaaat 
aaaagggacg 
gagatctacg 
cccgctaatt 
taaaccgagg 
ctgtctgctg 



gcaacccgaa 
gctggacttg 
gtaaaattaa 
ttaataacgc 
aaaataatgt 
gttcagtaag 



cccccgttgt 


ccctttgctg 


cattcactta 


60 


tgctaccggt 


cgttggaaat 


tgtctggcac 


120 


gcgaatccga 


tgagactgtg 


cagccataat 


180 


tatctgcgga 


taaagcagaa 


taggtggtta 


240 


tattgtattt 


cataatctat 


tgttccttag 


300 


gtaccaggag 


a 




341 



<210> 446 
<211> 697 
<212> DNA 

<213> Escherichia coli 



<400> 446 

agatttactg 

acgaatcmcg 

cgggcgctga 

caggatcgtt 

tctgaatacc 

cggcaatcgc 

tttatgcgtc 

aacacgtaat 

atccagagat 

cgtgktcccc 

caatgctttt 

gataacatcc 

<210> 447 
<211> 215 
<212> DNA 



ccaatttccg 
ggctataccg 
ttcttcccac 
gagcagcaga 
cgccagaaaa 
aatcattttt 
ataacttcac 
tcattgcccc 
gggaatactg 
agagccacca 
tgagttatca 
gttgccagat 



gcagatcgga 
ccaggcatgg 
gcggttattt 
tgatctacca 
ggtgccacag 
tccccttagt 
gtatgtagca 
caacattgaa 
cttttaatga 
actccgtttt 
ggtgcattct 
tggcacggat 



aagggttaam 


ccatattgat 


ccataagggt 


60 


cttgagccat 


ggcattaaat 


tccgcaaatt 


120 


tggcacacac 


cagatccagc 


aaggggtttt 


180 


gttccagcgc 


ctgggtgtat 


tgttcctcgt 


240 


cagttagctt 


ttctcctgct 


tgcaagatgt 


300 


acgatgaaca 


gcggtaaaga 


aatcgtattc 


360 


cttttgcgat 


tcaaaaaaga 


ccattgctac 


420 


aacataatgc 


ttatccagat 


atttgaagtt 


480 


ctcaggtttt 


ttgaaatatc 


ccttagcaat 


540 


atgttgcggg 


tatttttccg 


cagcatcttt 


600 


tcatcacgtc 


cgtkgmcaaa 


ttggcaatat 


660 


gaattat 






697 



-231- 



<213> Escherichia coli 



<400> 447 

aattaataac ttttcgttag gcagttttgg gtgtgagttg caagagggga gactactgaa 60 

taactcaagt tttataatcg aggggaaaat ggtgatggcg ttcatagcaa aacgccctca 120 

accataaagg tcgagggcgc ttaagatgtt aaaaacccgc tatccgttaa aaaacaatgt 180 

tcaactaagg tcagtgacat tgcgctaaaa aagcg 215 

<210> 448 
<211> 395 
<212> DNA 

<213> Escherichia coli 



<400> 448 

gcattattca 

ttaaggttta 

gcattgctat 

taagggacag 

attgaatgtt 

aaattcatat 

atattgaaat 



tgagaaatgt 
attatctgtg 
aatattggtt 
gcatagagta 
gacgctatgt 
aatgcgcctt 
agtccaacta 



gtatcgtaaa 
tgtgatattt 
atcatttgct 
atgatacgta 
gtttatgagg 
tgctcatgaa 
cttctttatt 



tcaactgaaa 
tattgaatgt 
gaatggattc 
tgcataacca 
gagaggtatt 
tggatgccag 
accaa 



ttaacgcaac 
tttaaatatt 
agtcttaatg 
acatctttac 
ttcagttgat 
tatgtagtgg 



catttgttat 
gtttttattg 
agtgggtttt 
tcattatgtc 
ctggattgtt 
gaaattataa 



60 
120 
180 
240 
300 
360 
395 



<210> 449 
<211> 641 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (641) 
<223> n = A,T,C or G 



<400> 449 

ataatcaggt 

cgcgtggcaa 

atgatatctt 

ctatacttat 

attgattatt 

aaattaatta 

gcgctcctgt 

gnntgcncgn 

cgatngtcgg 

cgaaaggctn 

tcctgagnan 



aagaaaaggt 
tacatcagtg 
ccgatttatc 
tcagcactca 
ttcctgattg 
atgtcatcag 
catgtgcatt 
cgncccacct 
cgggngcctc 
attncaaaga 
gacaaatncc 



gcgcggagat 
gcaataaaac 
ttaatcgttt 
caaataaagg 
ggctactggt 
gtccgaaaat 
gcttcatata 
naccccatgc 
cccatgcnan 
ctgggccttn 
gccgggagcg 



<210> 450 
<211> 314 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (314) 
<223> n - A,T,C or G 



taccgtgtgt 
gacatatcca 
atggataacg 
aacgccaatg 
ggtgactggc 
aacgagaata 
atcactggcg 
cgaacttcag 
agtangggaa 
cntttatctg 
gatttgaacn 



tgcgatatat 
gaaaaatata 
gcaaagggct 
aaaattatac 
gtatttaaga 
tttcagtctc 
caaggagcgc 
aantgaaaac 
ntgccangcg 
atgtttgtcg 
t 



tttttagttt 
cactaagtga 
tcgttttttc 
tctgggctgt 
tgatatttta 
tcatcctgtt 
cgcaggcgna 
nccntaacnc 
ncnnattaaa 
gagaacgctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
641 



<400> 450 

gaactacgag taagaatagc tncgaattcc cgtttatgga taacggcaaa gggcttcgtt 



60 



-232- 



ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat tatactctgg 120 

gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt taagatgata 180 

ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatatttca gtctctcatc 240 

ctgttgcgct cctgtcatgt gcattgcttc atataatcac tggcgcaagg agcgcgcagg 300 

gggntntnnt cttt 314 

<210> 451 
<211> 236 
<212> DNA 

<213> Escherichia coli 
<400> 451 

atatacacta agtgaatgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 60 
gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 120 
tatactctgg gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 18 0 

taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 236 

<210> 452 
<211> 418 
<212> DNA 

<213> Escherichia coli 



<400> 452 

cggagattac 

ataaaacgac 

atcgtttatg 

ataaaggaac 

tactggtggt 

cgaaaataac 

tcatataatc 



cgtgtgttgc 
atatccagaa 
gataacggca 
gccaatgaaa 
gactggcgta 
gagaatattt 
actggcgcaa 



gatatatttt 
aaatatacac 
aagggcttcg 
attatactct 
tttaagatga 
cagtctctca 
ggagcgcgca 



ttagtttcgc 
taagtgaatg 
ttttttccta 
gggctgtatt 
tattttaaaa 
tcctgttgcg 
gggggcggcc 



gtggcaatac 
atatcttccg 
tacttattca 
gattattttc 
ttaattaatg 
ctcctgtcat 
aatcgccgcc 



atcagtggca 
atttatctta 
gcactcacaa 
ctgattgggc 
tcatcaggtc 
gtgcattgct 
gccccctg 



60 
120 
180 
240 
300 
360 
418 



<210> 453 
<211> 551 
<212> DNA 

<213> Escherichia coli 



<400> 453 

aacaatttgc 

cgaccgccat 

cgctgcctaa 

cgctggcgga 

tgttacagga 

taactaactc 

cgtgtttgcc 

tccttttctt 

tcccctgttt 

gtaactagca 



ccatgcgctc 
tcgactgtta 
acggctggaa 
tgtcgttcgc 
ttagtggctt 
tctgtctaaa 
ctgcattgcg 
gtggtgcgcg 
gtaatacgta 

g 



ggtcatgcgc 
atgggcgaat 
gaagcgggtt 
tgatgtggtt 
tgcgcgataa 
cctctatcca 
ccttcttcac 
ttccgctatc 
attaaacagg 



tgcatcgccc 


ggccattttg 


sgcgtccccg 


60 


cttcagtact 


ggtattaggt 


ggacaacgcg 


120 


ttgcgtttcg 


ctggtacgat 


ttagaagagg 


180 


tacagcaaac 


atccgccagt 


taactcccgg 


240 


gatcgtctgg 


tgaaagtcgg 


gtcaccatca 


300 


gcatctcctg 


agcaatacgc 


agggcttctt 


360 


gtaatctgtc 


agcaatggtc 


atcaagtttc 


420 


tcaccaataa 


atgcacgaaa 


acgctgggca 


480 


gcttttagct 


gtctgtcatt 


agtgktccct 


540 








551 



<210> 454 
<211> 93 
<212> DNA 

<213> Escherichia coli 



<400> 454 

tggcatctcg gtgttgccga tcttcatgat atccagcccg ccggaaactt cttcccaaac 60 
ggttttgctg ttatccattg agtcacggaa ctg 93 



-233- 



<210> 455 
<211> 232 
<212> DNA 

<213> Escherichia coli 
<400> 455 

cgtgccgaga tgatcctgta accatcatca gttgtgaagt agtgattcac gacttcaagg 60 
cgcttttcaa aagggtattt tggctttgac atattagggg ctattccatt tcatcgtcca 120 
acaaaatggg tgcagtacat actcgttgga aatcaacaca ggaggctggg aatgccgcag 180 
aaatatagat tactttcttt aatagtgatt tgtttcacgc ttttattttt ca 232 

<210> 456 
<211> 713 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> inisc_f eature 
<222> (1) . . . (713) 
<223> n - A,T,C or G 



<400> 456 

ttagnggatn 

ccagcagagc 

cacgaacgcg 

ccatgctgta 

tacgaggtac 

gagttacttt 

gtgctttggt 

ccagcagttt 

cccgagggac 

cgtctttgta 

ctgagtagga 

tggnggcagg 



naangcccac 
gcggccttct 
gtcgacttta 
gttaccacga 
agcaatagtg 
acagccgatc 
gatcagcggt 
tttgtcagcg 
ttgcatgaca 
gtaatcatgc 
caaatccgcc 
accccgccat 



ancctcgang 
tcgtcagatt 
tcgtagtcga 
ccgtcgaaag 
atcaggcgct 
ggatagccct 
ttttgaccgg 
atcgcttcac 
gaattgtagt 
agtttcgcca 
ggagccggat 
aactggcagc 



gatctaggag 
tcgcagtagt 
tttctgggaa 
acttagcgga 
caaagaactc 
gacggatttt 
agattgctgc 
caacacccat 
taaactcagt 
tcgtactact 
ttaacgttgc 
attaaattaa 



gtagaatagc 
ggtaatggta 
gatgatctgc 
caggccacgg 
ccacatgcgt 
gaagcctgca 
caggtctgct 
gttcagggtg 
catgagtttt 
ccatgtcggt 
gaacaaccgn 
gcagaaggcc 



ttcgaattcc 
atatccaaac 
tcacggacac 
aagtcacgga 
tcgccacgca 
acagatttgc 
gctgcgttat 
atcttctcga 
ttaactactt 
gaacgctctc 
cccggagggg 
ate 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



<210> 457 
<211> 292 
<212> DNA 

<213> Escherichia coli 



<400> 457 

tgaacagcag 

cctttaagga 

gtttttagtt 

atcatgaatt 

tagggttata 



agatacggcc agtgcggcca atgttttttg tcctttaaac ataacagagt 60 

tatagaatag gggtatagct acgccagaat atcgtatttg attattgcta 120 

ttgcttaaaa atattgttag ttttattaaa tgcaaaacta aattattggt 180 

tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 240 

aatgcgacta ccatgaagtt tttaattgaa agtattgggt tg 292 



<210> 458 
<211> 282 
<212> DNA 

<213> Escherichia coli 



<400> 458 

ttattaaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 60 

sggggggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 120 

taattgaaag tattgggttg ctgataattt gagctgttct attcttttta aatatctata 180 

taggtctgtt aatggatttt atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 240 



-234- 



ccgccatatg aacggcgggt taaaatattt acaacttagc aa 



282 



<210> 459 
<211> 300 
<212> DNA 

<213> Escherichia coli 



<400> 459 

tctgcgttcc 

ggggaaggca 

ggggggcagc 

aaagggttat 

cttactaaca 



gctaaaaggt 
aaattgcctc 
aaggggctga 
gatgaagctc 
actcatcaga 



gcaaatgctc 
tgggaaagca 
aacgggaaag 
gtcatcatac 
ggggggagaa 



aggacgttgc 
ttgcgcgggg 
cccctcccga 
tggttgtgtt 
atcctccctt 



agcgttttgc 
tccggcgctc 
agaaggggcc 
gttactgtta 
acccttgttc 



gtgaccgctc 
atcaacaatc 
ttgtataagg 
agtttcccga 
ctttactcta 



60 
120 
180 
240 
300 



<210> 460 
<211> 293 
<212> DNA 

<213> Escherichia coli 



<400> 460 

cggggtccgg 

cccgaagaag 

gtgttgttac 

cccttaccct 

gtacgaagcg 



cgctcatcaa 
gggccttgta 
tgttaagttt 
tgttccttta 
agatctgtga 



caatcggggg 
taaggaaagg 
cccgacttac 
ctctaggttg 
accgctttcc 



gcagcaaggg 


gctgaaacgg 


gaaagcccct 


60 


gttatgatga 


agctcgtcat 


catactggtt 


120 


taacaactca 


tcagaggggg 


gagaaatcct 


180 


aaaaaacaac 


agcgtcaata 


ggcctgccat 


240 


ggttagcctt 


ttttatcctg 


ttg 


293 



<210> 461 
<211> 359 
<212> DNA 

<213> Escherichia coli 



<400> 461 

caacacagga 

ttcacgcttt 

caggagagtt 

gcgggaaatt 

tcttcacaaa 

ataaaaaaat 



ggctgggaat 
tatttttcac 
atgagctggc 
attcccgcct 
aataataata 
ccccgccaaa 



gccgcagaaa 
ctggatgata 
ggcgttttta 
tacttacggc 
gattttatta 
tggcagggat 



tatagattac 
agagattcac 
gcctgcaaat 
gttgcgcatt 
cgcgatcgat 
cttagattct 



tttctttaat 
tgtgtgaatt 
tgaaagagta 
ctcattgcac 
tatttatttc 
gtgcttttaa 



agtgatttgt 
gcatattaaa 
agagtcttcg 
ccaaatttat 
ctgaaaacaa 
gcagagatt 



60 
120 
180 
240 
300 
359 



<210> 462 
<211> 673 
<212> DNA 

<213> Escherichia coli 



<400> 462 

gcaacccatg 

ccatcatcag 

ggctttgaca 

ctcgttggaa 

atagtgattt 

ttgcatatta 

taagagtctt 

acccaaattt 

tcctgaaaac 

aagcagagaa 

ggtgaaggac 

gaagtgtacg 



tcctgacctg 
ttgtgaagta 
tattaggggc 
atcaacacag 
gtttcacgct 
aacaggagag 
cggcgggaaa 
attcttcaca 
aaataaaaaa 
tacaggctgg 
actttctgac 
aac 



ggttcggggg 
gtgattcacg 
tattccattt 
gaggctggga 
tttatttttc 
ttatgagctg 
ttattcccgc 
aaaataataa 
atccccgcca 
ttacgttacc 
cttcgtccag 



acaccaaaac 
acttcaaggc 
catcgtccaa 
atgccgcaga 
acctggatga 
gcggcgtttt 
cttacttacg 
tagattttat 
aatggcaggg 
agctgccggg 
agatttgtaa 



gtgccgagat 
gcttttcaaa 
caaaatgggt 
aatatagatt 
taagagattc 
tagcctgcaa 
gcgttgcgca 
tacgcgatcg 
atcttagatt 
cctttagcgc 
ccatcgttct 



gatcctgtaa 
agggtatttt 
gcagtacata 
actttcttta 
actgtgtgaa 
attgaaagag 
ttctcattgc 
attatttatt 
ctgtgctttt 
cgctttcgat 
ggatagcaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
673 
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<210> 463 
<211> 630 
<212> DNA 

<213> Escherichia coli 



<400> 463 

tggtggcatt 

caccacgggg 

acaacacagg 

cggctccggt 

agtaacgtgt 

caggggggaa 

cgttcaggcg 

agaccggtgc 

tgaaaatctt 

gttgttcgat 

caacggggac 



ggttgctgga 
gctaatcttg 
cggctatatg 
cattgctggc 
catgcgggcg 
actctgcggc 
aacgtacggt 
tgtttgtggc 
gccatcgaga 
gacaatcggt 
gtcggtgata 



gagagaaaac 
actctagacc 
acgttcgcag 
attcttgcca 
tcaggctgcc 
ctttttcgtt 
ttatcctggt 
aacccactgg 
acgcgaacca 
gccccctgat 
gacgggctag 



ccccgcacgt 
actcaagaat 
agctgggcat 
gtatgatcgt 
gtaatggcaa 
cttactgcgg 
attgaataac 
tgagtttttt 
ccagatcgga 
gcggtgcctt 



tgcaggtatg 
agccgcgaaa 
ggccttctgg 
gaactggctg 
tttgcgcccg 
gtaaggcacc 
tactgcattt 
ccagtcaaca 
gatagccagg 
catgccgaag 



cacctgacaa 
cgttgtcatt 
catgatttag 
aacaagcgga 
gaccaggccg 
cagtcgccgc 
gagttctcgg 
ttgtcttcgg 
aagctgctcg 
aatttcaccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
630 



<210> 464 
<211> 391 
<212> DNA 

<213> Escherichia coli 



<400> 464 

ctcaggctgc 

gagagaatca 

aaaaaatttg 

gttgctgctt 

tggcccagcg 

caaccagcgc 

ttgtctccag 



tgattgtttt 
taaacgtggt 
agggataagg 
gagggtcgtt 
gaccagcata 
ccaggagcgc 
ccattcacga 



tttgtgcaat 
gaatgatgat 
ccggaatggc 
ttaaccagac 
ttaggatggc 
agacttagca 
cgactggcgg 



ggcgcggtat 
tgttagcaag 
tccggccaga 
gccaggcgct 
gaatcgtcca 
gcatattcca 
a 



tagcgtcgtt 
gaaaactgtc 
gggaagttaa 
ccatacgcca 
gatcgccatc 
gcgacgatcg 



gctgtcgatg 
aaaaatcttc 
ccgcgaagct 
aaaccgcgtc 
acgctactgc 
taagcgcctg 



60 
120 
180 
240 
300 
360 
391 



<210> 465 
<211> 625 
<212> DNA 

<213> Escherichia coli 



<400> 465 

aacacaccac 

gtactgggaa 

gactaatgaa 

ggcctgattt 

ggatagattt 

ttgacacagt 

acattttgtt 

gcccggtccg 

cctgctgtca 

aactggcaaa 

atgataaatt 



accataaacg 
taattttatt 
cggagataat 
ttatgacggc 
gtgttttgct 
agcatcagtt 
cgtcaataaa 
ctcttccaat 
cggtaatgtt 
ttctgacacc 
cttcgccccc 



gaggcaaata 
ttctggtttt 
ccctcaccta 
gaaaaaaaac 
tttacgctaa 
ttctcaatga 
catgcagcga 
gaccacatcc 
gatatgccct 
tgcacgacat 
gtagc 



atgctgggta 
ctggccgcgt 
accggcccct 
cgccagtaaa 
caggcatttt 
atgttaaacg 
tttcttccgg 
agaggctctt 
tcagaatgtg 
gcttcttcat 



atatgaatgt 
atttcagcca 
tgttacagtt 
ccggcggtga 
cctgcactga 
gagcttaaac 
tttgcttacc 
caggaaatgc 
tgatggcatg 
cattagccgc 



tttaatggcc 
caaatgggat 
gtgtacaagg 
atgcttgcat 
taacgaatcg 
tcggttaatc 
ctcatacatt 
gcgactcaca 
gttatcgact 
tttgacaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
625 



<210> 466 
<211> 623 
<212> DNA 

<213> Escherichia coli 



<400> 466 



-236- 



tgcttttgaa 
aaaacgataa 
tctttcggta 
agtggctcgt 
ctcaactgta 
ccaccgaagc 
atccataaag 
aaataacgac 
gatgtccgcc 
cacagttaaa 
aactggttgc 



tatgtgctcg 
tctgaaagag 
tcagccagag 
tagcaaaaag 
agggtattga 
ccggaggtgg 
agtcggtctt 
attgctgtgt 
ctttttaaag 
agtcatgtta 
agtcacctgg 



caatcttgag 
ccaaggtatt 
agtgagacca 
tttcaacaat 
aatgttaaca 
tgaaataaaa 
gtctgttaaa 
gtagtcctgg 
tgaattttgt 
gtccttattg 
agg 



aaggaaatgg 
tcagtataag 
aaaatgataa 
ggagtaaata 
caagctctcg 
ccgggcacaa 
tttaaatggt 
cggcatcagt 
gatgcggtga 
gtttgggtgg 



cgaccacgaa 
cattgaatgc 
tcgtatacat 
catccaacat 
ctgtaggggt 
cacgaaggcg 
gggagtgcgc 
ttttttcttg 
atgcggctaa 
gaaagccgac 



agaaaaggca 
gacagtaaac 
aagtctttcg 
atcaataact 
atagccgaga 
catttccgat 
ctccggttgt 
aagttcggct 
gcgcacgtgg 
tgtaattgtt 



<210> 467 
<211> 234 
<212> DNA 

<213> Escherichia coli 
<400> 467 

tgtttactta caagagattc atctttgtat aaataaagat aagtaattac gcataaaaca 
acaatgatta taatagcaaa aataaatatt atcatctttg atagattact tgagatagcc 
agcatcttgt aaagccttta tcgttttttt atgctctgga ttaatataat cactacatct 
atctgagcaa tctgttgttg atggacatgt caacccatgg tcatttacag ccaa 

<210> 468 
<211> 529 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
623 



60 
120 
180 
234 



<400> 468 

attagctatt 

tagttccgtt 

gctggtaatc 

atgaacaact 

tattttctgt 

ctccaactta 

ccaccgattt 

tcaggttatg 

agtggttttg 



tcggctaaaa 
gtaagttttt 
tctaaaaaga 
gtccatgatt 
gccaatatta 
ttgatagtgt 
tgagaacgac 
ccgctcaatt 
accacgaggg 



tagagactac 
ccatagcttg 
tggcatggtt 
tcgtttaaga 
tctctaattt 
tttatgttca 
agcgacttcc 
cgctgcgtat 
agcttcacgc 



atgtcttcgg 
cactgctaaa 
tacaatgatt 
atgaagagaa 
caaaaaagtt 
gataatgccc 
gtcccagccg 
atcgcttttc 
gagttattga 



tccatctcac 
tttcgaacaa 
tttgtttcct 
atcactaaac 
acttttaatg 
gatgactttg 
tgccaggtgc 
cttatcagtt 
aaaccctga 



ttaaggagtg 
ggaattttct 
tttgattatt 
gaactgaata 
tcggtaatga 
tcatgcagct 
tgcctcagat 
cgttgatgtc 



<210> 469 
<211> 261 
<212> DNA 

<213> Escherichia coli 
<400> 469 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 
aattaatgca attatcgaaa attcaaaaaa tatccaaaaa tagtatactt tattccagaa 
gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgctcat 
tacattgtga gcttcatctt tatttaattt tctgttgact ccagctctcc gtgataacgg 
ttttataatt agatgcttat c 

<210> 470 
<211> 98 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
529 



60 
120 
180 
240 
261 



<400> 470 
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agatgattgc cgggaacttg ttagcggcac gcaggcggcg gctcgcaccc ttaccctgct 
ctttacgtac ttctgcgttg atagtaaaca tttctttc 



60 
98 



<210> 471 
<211> 259 
<212> DNA 

<213> Escherichia coli 



<400> 471 

agcgcgaacg aagtcgatgt gctgcagctt cggtttgtac gggtgacgct gtacgtcctg 60 

agctttaact ttgatttctt taccgtcaac aacgatggtc agaacttcgc tgtagaattc 120 

agctttagct tgcatgttca tgactttgtc gtgatccagc tcgatagcca gcggcgcttc 180 

tttgccaccg tagatgattg ccgggaactt gttagcggca cgcaggcggc ggctcgcacc 24 0 

cttaccctgc tctttacgt 259 

<210> 472 
<211> 94 
<212> DNA 

<213> Escherichia coli 
<400> 472 

aaaaacggcg taaagaaagg atgcaaacat gttaataaaa actcaaattg atcccacgta 60 
tatattacgc cgcaaaatcc ttacaataaa cagg 94 

<210> 473 
<211> 174 
<212> DNA 

<213> Escherichia coli 



<400> 473 

ttaattatta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 60 

agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 120 

acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 174 

<210> 474 
<211> 138 
<212> DNA 

<213> Escherichia coli 



<400> 474 

ctgtaaaaac gtcaaaaaga gtgttttatc aacagaagaa tggaggtctg acagatagta 60 

gtaatgcaaa aaaatggaga cttaagttga atgaacggga gtaaagcgaa aagactatag 120 

agtgaaggag aaattccc 238 

<210> 475 
<211> 191 
<212> DNA 

<213> Escherichia coli 



<400> 475 

tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg 60 

gtttttggag agaagaatga ggaagatgcg tcgagccaca gaaacgttag ctttacatat 120 

agcggaggtg atgtgaaatt aatttacaat agaaataatt tacatatcaa acagttagat 180 

gctttttgtc g 291 

<210> 476 
<211> 245 
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<212> DNA 

<213> Escherichia coli 



<400> 476 

cggccattta tacaggaaaa gcctatgtca gaacgtaaaa actcaaaatc acgccgtaat 
tatctcgtta aatgttcctg cccaaactgc acccaagagt cagaacacag tttttcaaga 
gtacaaaaag gtgccctttt gatctgccct cattgcaaca aagtattcca gacaaatctt 
aaagctgtag cctgattgat tttattagta acaagtattt tttatatttt aataatatat 
ttaaa 



60 
120 
180 
240 
245 



<210> 477 
<211> 319 
<212> DNA 

<213> Escherichia coli 



<400> 477 

aaattttcag 

tcttgaggat 

aataatttgt 

tcattaacgt 

tgaatctgga 

tcaatcacta 



gtaccttgtc 
ctttaactcc 
ctttataaat 
aatcccatat 
taccctcaca 
aaacttttt 



accatacttt 
ccacatttgg 
cgccagtgga 
atatttatca 
gtcctctgtg 



tttttctgag 


cattaatgat 


attttgagct 


60 


tggaaagtat 


tcatattaaa 


aggaaggttg 


120 


gaattagtaa 


aacgattaaa 


ttctactaaa 


180 


ttggtatgaa 


aaatatgtgc 


accatattta 


240 


tacgcatttc 


caccgatatg 


atttcttttc 


300 








319 



<210> 478 
<211> 149 
<212> DNA 

<213> Escherichia coli 
<400> 478 

gcagtgatcg aagcgatgac gaagtgtatg gaaaaatcag aaaaactcag caaatcctga 60 
tgactttcgc cggacgtcag gccgccactt cggtgcggtt acgtccggct ttctttgctt 120 
tgtaaagcgc caaatctgcc gatttcaac 149 

<210> 479 
<211> 330 
<212> DNA 

<213> Escherichia coli 



<400> 479 

gaaagtatct 

gaagcgcgta 

aactacagat 

tatttagatt 

acccacgttc 

ataggcgctg 



tcgttattga 
ccgtatctgg 
aacatttgcg 
aaattaccag 
gcggaccagt 
aagttcttgg 



catcactgga 
acaaacattt 
cgtcctttgc 
tatttcttcg 
cttgatctac 
tgaagctaaa 



aaatataact 


tgcttttcat 


tattaaactc 


60 


atcgagctta 


ccaaattcct 


gaagaggttt 


120 


agtaatgccc 


gtcaaatcct 


tgacgggcat 


180 


gagtgaagaa 


tattaccagg 


tatatttaac 


240 


gtcaccacca 


ccgaggtagt 


tagcatcggt 


300 








330 



<210> 480 
<211> 191 
<212> DNA 

<213> Escherichia coli 



<400> 480 

tttttttcca gcaacggagc aaaaggtttg cccttgtgca gctcagggtt aaccacttta 60 

actacgtggc gacgacccgg agatgtcggt ttacatttaa caactgccat tgtattactc 120 

ctccgactta ctcagcgccg ccaacgaagt ccagattctg gccttctttc agggtgacgt 180 

aagctttttt c 291 
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<210> 481 
<211> 188 
<212> DNA 

<213> Escherichia coli 
<400> 481 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 60 
cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtacgatg gtgttggatt 120 
tttccatcgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 180 
gttcttca ]_gg 

<210> 482 
<211> 172 
<212> DNA 

<213> Escherichia coli 
<400> 482 

caaaggcgaa caaagcctgt gaagcccgaa ggctccacag acagtgctac ttgaaggcct 60 
tactgtttct tcttaggagc gagcaccatg atcatctggc ggccttcgat cttcgttggg 120 
aaggattcga ccactgccag ttcttgcaaa tcgtctttca cgcgattaag ca 172 

<210> 483 
<211> 266 
<212> DNA 

<213> Escherichia coli 
<400> 483 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 60 
gaactgatcc gctttctgct tcagcttctg aactgatagc ggaaacgtaa ttaagggcta 120 
agagcacact actcttagcc ctttaacatt taacgcattg tcacgaactc ttctgccgcc 180 
gttgggtgaa tggcgacggt attgtcgaag tcttttttgg ttgcccccat cttcagcgcc 240 
accgcgaagc cctgcaacat ttcgtc 266 

<210> 484 
<211> 259 
<212> DNA 

<213> Escherichia coli 
<400> 484 

cgcaggcagc tgatggtcaa caggatgaga gaaacccaga gacaggttaa tcacattgcc 60 
tttaaccgct gcacggtaac ctacaccaac cagctgcagc 'ttcttagtga agccttcggt 120 
aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 180 
tgcgtaacca tcacgcggac cgaaggtcag ggtattatct gcatgtttaa cttcaacagc 240 
atcgttgaga gtacgagtc 259 

<210> 485 
<211> 73 
<212> DNA 

<213> Escherichia coli 
<400> 485 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 
ccgtttaccg ggg -^3 
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